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A MATHEMATICAL ANALYSIS OF THE EVOLUTIONARY 
BENEFITS OF SEXUAL REPRODUCTION 

ANDREW E.M. LEWIS-PYEi & ANTONIO MONTALBAN^ 


Abstract. The question as to why most higher organisms reproduce sexually has re¬ 
mained open despite extensive research, and has been called “the queen of problems in 
evolutionary biology”. Theories dating back to Weismann have suggested that the key 
must lie in the creation of increased variability in offspring, causing enhanced response 
to selection. Rigorously quantifying the effects of assorted mechanisms which might 
lead to such increased variability, and establishing that these beneficial effects outweigh 
the immediate costs of sexual reproduction has, however, proved problematic. Here 
we introduce an approach which does not focus on particular mechanisms influencing 
factors such as the fixation of beneficial mutants or the ability of populations to deal 
with deleterious mutations, but rather tracks the entire distribution of a population of 
genotypes as it moves across vast fitness landscapes. In this setting simulations now 
show sex robustly outperforming asex across a broad spectrum of finite or infinite pop¬ 
ulation models. Concentrating on the additive infinite populations model, we are able 
to give a rigorous mathematical proof establishing that sexual reproduction acts as a 
more efficient optimiser of mean fitness, thereby solving the problem for this model. 

Some of the key features of this analysis carry through to the finite populations case. 

Sexual propagation must certainly confer immense benefits on those populations un¬ 
dergoing it, given that sex involves substantial costs such as the breaking down of 
favourable gene combinations established by past selection. Hypotheses as to the form 
these advantages take fall naturally into two groups.^^ On the one hand a function 
of sexual reproduction and meiotic recombination may be in providing immediate and 
physiological benehts, such as allowing repair of double strand DNA damageBlUl Such 
mechanisms alone, however, are unlikely to account for the continued prevalence of 
sexual reproduct ion |300 and so, on the other hand, decades of research have seen evo¬ 
lutionary biologists looking to develop explicit theoretical models which explain the 
advantages of sex in terms of the interaction between variation and selection. Many 
of these model^®^ focus on ideas originally due to Morgan,^ Fishei^ and Muller,^ 
which stress the ability of recombination to place benehcial mutations together on the 
same chromosome. In a similar vein one may also consider the build up of deleterious 
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mutationsHHm Sex has been shown to be favoured in certain models which allow for fluc¬ 
tuating environmentd^^^ (e.g. cycling between negative and positive epistasis), so long 
as fluctuations are sufficiently rapid. When the htness of genotypes depends upon their 
geographical location, sex may also evolve under the right circumstances to break down 
locally detrimental genetic associations created by migration USES In general though, 
while great strides have been made in our understanding of some of the principal pop¬ 
ulation genetic mechanisms yielding an advantage to sex,^ the present analysis tends 
to require rather specihc conditions for these mechanisms to be of relevance, and does 
not provide a setting which allows one to establish that the benehts conferred outweigh 
the acute costs. 

Our aim here is to establish a setting in which sex is seen to robustly outperform asex, 
even in the absence of epistasis and across a broad spectrum of models. We consider 
a setting in which a population of genotypes evolves over time in a context where 
there are no apriori limits on the number of alleles or their htnesses. Figure 1 shows a 
small cross-section of the results of simulations for models with hnite or inhnite haploid 
populations and where htness contributions from individual genes may be combined 
additively or multiplicatively (further examples are given in Figures 6-10 §4 Extended 
Data). In all cases the sexual population is seen to quickly achieve and maintain higher 
mean htness, indicating that selection will favour genes coding for sexual rather than 
asexual reproduction - the one exception being the boundary case of the multiplicative 
inhnite populations model, in which the sexual and asexual processes remain identical 
in the absence of any initial linkage disequilibrium. 

We then concentrate our mathematical analysis on the inhnite populations additive 
model, since dealing with this case allows us to avoid some of the complexities inherent 
in the hnite population models while illustrating basic principles which carry through 
to the hnite population additive model. We are able to give a mathematical proof that, 
during the process of asexual propagation, a negative linkage disequilibrium will be 
created and maintained, meaning that an occurrence of recombination at any stage of 
the process will cause an immediate increase in htness variance and a corresponding 
increase in the rate of growth in mean htness. In contexts where there is a large but 
hnite bound on allele htnesses, we prove that the sexual population will always be that 
which survives when sexual and asexual populations compete for resources. 

1. The model 

We consider haploid populations with non-overlapping generations. In the absence 
of epistasis between alleles at a single locus, this analysis could easily be extended to 
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consider diploid populations. We describe here the additive inhnite population variants 
of the model (other variants are described in (SI) 15.5). A slightly unusual feature of 
the model is that we do not assume alleles come from a pre-existent pool, but consider a 
(form of random walk mutation) model in which alleles are created by mutation as time 
passes, possibly without any bound on attainable htness. We shall assume that genes 
htnesses take integer values, but one could also consider real valued htnesses without 
substantial changes in behaviour. Most other features of the model, which we now 
describe in more detail, are essentially standard in the literature. 


Each instance of the model is determined by three principal parameters: £, D and 
/x. First, £ G N (> 1) specihes the number of loci. With each individual specihed by 
£ genes, in the absence of epistasis we need only be concerned with the htness values 
corresponding to those genes, and so each individual can be identihed with a tuple 
X = {xi, ..., Xi) G The fitness of x is F{x) = (For the multiplicative model, 

one would dehne F{x) = Y\l=i instead.) Second, the domain D <Z determines 
which individuals are allowed to exist. We will use three types of domains in this 
paper: The H-model uses as domain D = where N = {1, 2, 3,....}; the Z-model uses 
D = {x E Z^ : F{x) > 0}; and the bounded-model uses D = {!,..., NY for some upper 
bound N eN on gene htnesses. In practice there is almost no diherence between the N- 
and Z-models, but there are situations when it is simpler to consider one or the other. 
Third, /r: Z —)■ ]R-°, the mutation probability function, determines how mutation ahects 
the htness of genes: /i(fc) is the probability that the htness of a gene will increase by 
k. For the sake of simplicity we assume this distribution to be identical for all loci. 
While there is no clear canonical choice for fi, the behaviour of the model is robust to 
changes in this parameter so long as negative mutations are more likely than positive 
ones, both being possible. This is because any such choice of /x will approximate a 
Gaussian distribution over multiple generations. The simplest mutation distributions 
one may consider are those taking non-zero values only on {—1,0,1}. Unless stated 
otherwise, it should be assumed that from now on mutations are of this form and that 
/x(0) > /i(—1) > /i(l) (giving a form of stepwise-mutation modeP^^. 

By a population we mean a probability distribution 0: Z^ —)■ M-°, where 4>{x) is the 
proportion of individuals that have ‘genotype’ x E Z^. For a population we shall also 
use X = (Ai,..., Xf^), where the Aj’s take values in Z, to denote a random variable that 
picks an individual with gene htness values Xi,...,X£ according to the distribution given 
by <fi. We let M{<fi) denote the mean fitness of the population <fi, namely E{F{X)). 
It should be assumed throughout that all populations considered have hnite means. 
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variances, and that all cnmulants are finite (as is the case, for example, for distribntions 
(p with finite snpport, i.e. with finitely many x ^1/ snch that 0 (a?) 7 ^ 0 ). 

For a sexnal popnlation, the next generation is obtained by application of three oper¬ 
ations: selection, mntation and recombination. We refer to the consecntive application 
of these operations over mnltiple generations as the sex process. For the asex process, the 
operations applied are selection and mntation, and the recombination phase is omitted. 
With a mnch less significant effect, at the end of each generation we will also apply a 
trnncation operation that erases individnals falling ontside the domain. 

Selection. The probability of snrvival for an individnal is proportional to its fitness 
valne. If cp is the population prior to selection then the resulting population, Sel(0), 
is given by: 

Sel{(p){x) = ^^(p{x), foTxeZ^. 

The factor 1/M{(p) normalises the probability distribution. 

Mutation. Let Ci be i.i.d. random variables taking values in Z with distribution p. If 
we apply mutation to a random variable X = (Xi,..., Xf) we get (Xi + Ci,Xi + Cp). 
Equivalently, if (p is the population prior to mutation then, for x G Z^: 

Mut(0)(a;) = (p{y) ■ y,{y - x), 

y&D 

where y, is the extension of /i to a function on 1/ according to the assumption that 
mutations act independently on distinct loci (i.e., y{ai, ...,ae) = Y\l=i ■ 
Recombination. For the sake of simplicity we assume that the ^ loci are unlinked, so 
that they either correspond to loci on distinct chromosomes (one may consider that we 
are choosing a ‘representative’ from each chromosome), or else he at sufficient distances 
when they share a chromosome. In general the effect of recombination is to leave 
the distributions at individual loci unchanged, while bringing the population towards 
linkage equilibrium. We make the simplifying assumption (for the infinite models) that 
the effect of a single application of recombination is to bring the population immediately 
to linkage equilibrium. (A population is at linkage equilibrium if the random variables 
Xi are independent.) If (pi{x): Z —)■ is the distribution at locus i, (i.e. (pi{x) = 

Yliy&Dyi=x 4>{.v)) then the resulting population is given by: 

i 

Rec((/))(a 3 ) = ]^ 0 i(a;i), for x = {xi, ...,xe) G Zp 

i=l 

Recombination as we consider it here is thus equivalent to multiple applications of 
recombination in its standard form. 
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Mutation and recombination may create individuals that fall outside the domain D. 
At the end of each generation, we therefore perform truncation to remove those outlying 
individuals. Tru(0)(a;) is defined to be 4>{x)/s ii x ^ D, and 0 otherwise, where s is the 
normalising factor s = (Tables 1-3, §4 Extended Data) that 

the proportion of the population moving outside the bounds of D in each generation is 
negligible, and that truncation along the lower bounds will have an insignificant effect 
on the whole process. 


2. Analysing the model 


The objective now is to show that mean fitness increases more rapidly for sexual 
populations (reduction to selection at the gene level can then be achieved in a standard 
fashion, by consideration of the effect of selection on genes which code for sexual rather 
than asexual reproduction). Proofs of all claims in this section appear in (SI). 

Each generation sees two forces acting on the mean fitness M = On the one 

hand, mutation causes a hxed decrease in M by an amount that depends only on fi. 
(Recall that deleterious mutations are more likely than beneficial ones.) Selection, on 
the other hand, can be shown to increase mean htness by V/M (a form of Fisher’s 
‘fundamental theorem^, where V = V{cf)) = Vai{F{X)) is the variance of the fitness 
of (p. Recombination does not affect M directly. Thus, for hxed /i, the increase in mean 
htness at each generation is determined by the variance. The diherence between the sex 
and asex processes will be seen to stem from the ehect of recombination on variance, 
which then results in an increase to the change in mean htness for the sex process during 
the selection phase. 

The ehect of mutation on the variance is a hxed increase at each generation (again 
entirely determined by p). The ehect of selection on variance is given by: 

r(sei(^)) - r(^) = II - (P'' 

where is the third cumulant of F{X). Our hrst theorem shows that for the sex 
process, the ehect of recombination on variance is positive, giving an advantage of sex 
over asex. 


Theorem 1. If <fi* = Sel(0) was obtained by an application of selection to a population 
<f> at linkage equilibrium, then the effect of recombination on fitness variance is given 
by: 

V(Rec(^-)) - 

where V) = Var(0j) and M = M{<fi). This effect is therefore non-negative. 
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This theorem applies to the sex process because a previous application of recombina¬ 
tion would bring the population <p to linkage equilibrium. Linkage equilibrium is then 
preserved by mutation. 

Our second theorem shows that recombination has a positive effect on variance in a 
much more general situation, as for instance, during an asex process where we suddenly 
apply recombination. It establishes that for a population initially at linkage equilibrium, 
any subsequent applications of recombination during later generations always give an 
increase in variance and so a corresponding increase in the rate of change of mean fitness. 

Theorem 2. For the Tj-model, starting with a population at linkage eguilibrium, suppose 
we iterate the operations of mutation, selection and recombination in any order (possibly 
applying only mutation and selection over multiple generations, and of course applying 
truncations when relevant). Then any non-trivial application of recombination has a 
positive effect on variance. 

By a trivial application of recombination we mean one acting on a population which 
is already at linkage equilibrium, and so which has no effect at all. This is the case, 
for instance, if one applies recombination twice in a row: the second application is 
trivial. The theorem is stated only for the Z-model because truncation creates technical 
difficulties when producing a proof for the other models. With the effect of truncation 
being so small, however, the claim of the theorem is, in fact, verified in all simulations 
we have run for any of the models. 

To explain what is behind Theorem we need to introduce two new key terms: the 
linkage disequilibrium term LD 2 and the flat variance. We define LD2{4>) to be the 
decrease in variance produced by recombination: 

LD 2 ( 0 ) = f"( 0 )-f"(Rec( 0 )). 

LD 2 can be shown to be equal to the covariance term E{XiXj) — E{Xi)E{Xj). 
Theorem 1^ states that LD2{4>) is negative at all stages of the process, unless the popu¬ 
lation is at linkage equilibrium, in which case LD2{4>) = 0 . 

A more geometric way of understanding LD 2 is through the notion of flat vari¬ 
ance. Let M = {E{Xi), E{X 2 ), ..., E{Xf)) G this vector represents the aver¬ 
age individual in the population. The global variance of a population is defined as 
GV{(p) = L^(||X — AT|p). Recombination does not affect the global variance, GV{<p), 
at all. However, it changes the shape of the population by increasing the variance in the 
direction that is useful for selection, namely the fitness variance. Consider the diagonal 
line d = {(xi, ...,Xi) E : Xi = X 2 = ■ ■ ■ = Xi} and its {£ — 1)-dimensional orthogonal 
complement P = {(xi, ...,Xi) G : Xi -|- X 2 + • • • + = 0}, and let vr^ and Tip be the 
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projection functions onto d and P respectively. Using that F{X) is the inner product 
of X and (1,1,1), one can show that: 

U(0 )=£-Var(||7 r,(X)||). 

We dehne the flat variance of a population to be the variance of its projection onto P 
multiplied by a correcting factor: 

FV(4>) ^ jP- E(,\\t,p(X - M)f). 

Informally, V{4>) measures how tall a population is along the vector while 

FV{(p) measures how fat it is. The effect of recombination on variance and flat variance 
satishes a simple formula: 

and hence 

LD2 = Pf(v(4,)-FV(4,)y 

Thus, LD 2 being negative is equivalent to FV being greater than V, or, more informally, 
the population being fatter than it is tall along d. The dynamics of this interaction 
are explained in Figure 2, and the effects for unbounded and bounded domains are 
illustrated in Figures 3 and 4 respectively. 

Theorems [T] and show an important advantage that sex has over asex. In compar¬ 
ing sex and asex populations evolving independently, however, these theorems do not 
suffice to entirely specify how the variances of the two populations differ at any given 
generation. To make this comparison we would need to understand the evolution of the 
third cumulant, which behaves differently in each process. The evolution of the third 
cumulant depends on the fourth, which depends on the hfth, and so on. 

Rather than analysing further the evolution of populations over time, we now study 
what happens to the sexual and asexual populations in the long term. We prove that, 
for the bounded model, whatever the initial populations are, sex outperforms asex in 
the long run. 

We state the following theorem in terms of a mixed population containing both sexual 
and asexual individuals competing for resources. Thus the population distribution cp 
now has domain D x {s,a}, the second coordinate indicating whether the individual 
is sexual or asexual. Mutation acts exactly as before among each type of individual. 
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the asexual population. At stage 2000, a single application of recombination is made to the previously 
asexual population, converting that flat variance into variance and greatly increasing the rate of increase 
in mean htness. Without any subsequent applications of recombination, however, the asexual population 
will eventually have smaller variance than the sexual one and will once again fall behind in mean htness. 
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Figure 4. This figure shows the level curves for 2-locus populations 
proceeding according to the bounded model, with maximum allele fitness 
400, mutation rate 0.2, and with the probability any given mutation is 
beneficial being 10“^. All alleles initially have fitness 50. The probability 
density level curves are depicted at stages 500, 1500, 2500, 3500, 4500 and 
5500. We can again observe the increase in fiat variance and decrease in 
variance for the asexual population, and also that the sexual population 
does not necessarily have a higher global variance. 


Selection is also the same, now using M{<p) = J2xeDx{s a.} normalise. 

Recombination acts only among the sexual individuals. 
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Theorem 3. Given and i, for all sufficiently large bounds N and for any initial 
population in which the proportion of sexual individuals is non-zero, the proportion of 
sexual individuals converges to 1 and the proportion of asexual ones converges to 0. 

This is the longest and most complicated proof of the paper: the proof appears 
in (SI j ^5.4[ ). Applying the Perron-Frobenius Theorem suffices to prove that the asex 
distribution converges to a limit, and using other techniques we are also able to find a 
good approximation for the mean fitness at that limit. We do not prove that the sex 
process converges to a limit, but still get a good estimate of the geometrical average of 
the mean fitness over generations. Such ideas would not work for the N- and Z-models as 
in those cases the mean fitness diverges to infinity in both the sex and the asex processes. 
Figure 5 shows the manner in which sexual and asexual populations converge to their 
respective fixed points over time (while we do not prove that convergence to a fixed point 
always occurs for sexual populations, such convergence was observed in all simulations). 

3. Discussion 

In nature one must surely expect a variety of mechanisms to be of significance in deter¬ 
mining the most efficient methods of reproduction. As well as those factors mentioned in 
the introduction, sex may provide advantages for species not subject to random mating 
by strengthening selection,^ for example, or may provide a straightforward advantage 
in providing two parents to care for young offspring.!^ Such arguments, however, do not 
suffice to explain the prevalence of sex in species for which random mating is a good 
approximation or without parental care. Our aim here has been to rigorously establish 
a fundamental and underlying mechanism conferring strong advantages to sex. We have 
seen that independence between loci allows for more rapid growth in mean htness. In 
the absence of such independence, the selection of fitter alleles at a particular locus will 
be stronger when other genes have lower fitness values. A simple analogy may be given 
in terms of the comparative value of improvements to sensory abilities: If an organism 
has little sight, a small improvement in hearing may be more important than it is for 
an organism with excellent vision. Thus, in the asex process, the result is that indi¬ 
viduals which have high fitness on a gene, tend to have low fitness on another - this 
is essentially what negative LD 2 means, and what is behind the proof of Theorem 
The effect of the sex process is to break down these negative associations, but not to 
increase or decrease the global variance of a population. The key role of recombination 
is to transform the variance produced by negative associations - the flat variance - into 
the form of variance which can then be acted upon by selection - the fitness variance. 
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Figure 5. Each of the six plots shows the trajectory of the centre of mass 
for various sexual and asexual 2-locus populations over multiple genera¬ 
tions, for a number of different initial populations and for the bounded 
model. Each point represents the centre of mass of a population at a 
single generation, and the populations were then allowed to evolve for 
sufficiently many generations that an equilibrium point was reached. The 
bottom-left plot shows intermediate steps in the evolution towards the 
middle plot in the bottom row. For that plot, we have 40 different initial 
populations, half sexual (red), half asexual (blue). The bound, N, on 
gene htness is 50 for all plots except for the top-centre and bottom-right, 
where N = 301. The probability of mutation is 0.5 except for the top- 
left plot, where the probability of mutation is 0.9. The probability that 
a mutation is benehcial is 0.001 in all cases. Starting from the top-left 
and moving clockwise, the original populations are Gaussian distributions 
with standard deviations 5, 25, 6, 8, 6 and 6 respectively. 

Of course a natural question, having considered the inhnite populations case, is the 
extent to which this analysis carries over to the hnite populations model. The principal 
difference in moving to hnite populations is that the process is no longer deterministic. 
The equations governing the change in mean htness and variance due to selection and 
mutation for the inhnite population model would now perfectly describe the expected 
ehect of mutation and selection for hnite populations, and the hnite populations model 
could be seen simply as a stochastic approximation to the inhnite case, were it not for the 
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loss in variance and higher cumulants due to sampling (since picking n individuals from 
a distribution with variance v produces a population with expected variance v{n — l)/n). 
For large populations this effect will be very small on a stage by stage basis, and so our 
analysis for inhnite populations can be seen as a good approximation over a number of 
generations which is not too large. Ultimately, however, sampling will have the effect 
that mean htness for the population no longer increases without limit: once variance is 
sufficiently large the expected loss in variance due to sampling balances the increase that 
one would see for an inhnite population with the same cumulants. Larger populations 
are thus able to sustain much higher mean htnesses than small ones. 

While sexual reproduction has been seen here to confer strong advantages in the 
absence of epistasis, i.e. in the setting of simplistic and entirely modular htness land¬ 
scapes, we have said nothing about how this picture changes in the presence of epistasis. 
Assuredly, the task of efficiently navigating htness landscapes (i.e. optimisation) is one 
that, beyond its relevance here, is of fundamental signihcance across large areas of 
applied mathematics and computer science (hence the recent interest of computer sci¬ 
entists in the benehts of sexual reproductiorP^. However large the role of epistasis in 
the biological context, it is certainly true that in most of these applications epistasis 
(in one guise or another) plays a crucial role, and so the interesting question becomes 
that as to whether sexual reproduction continues to oher these substantial benehts in 
the face of more complex htness landscapes. It may be the case that as well as capi¬ 
talising more efficiently on existing modularity, sex plays a fundamental role in finding 
modularity.^ One would expect a proper analysis to require classihcation of htness 
landscapes in terms of their amenability to diherent forms of population based search 
(see, for example, the work of Prugel-BennetP^. 

References 

[1] Kondrashov, A.S. Classification of hypotheses on the advantage of amphimixis. J. Hered. 84: 372- 
387 (1993). 

[2] Maynard-Smith, J. The evolution of sex. Cambridge University Press, Cambridge (1978). 

[3] Felsenstein, J. The evolutionary advantage of recombination. Genetics 78: 737-756 (1974). 

[4] Bernstein, C. & Bernstein, H. Aging, sex and DNA repair. San Diego Academic Press (1991). 

[5] Michod, R.E. Genetic error, sex and diploidy. J. Hered. 84: 360-371. 

[6] Maynard-Smith, J. The evolution of recombination. In The evolution of sex: an examination of 
current ideas, Michod, R.E. & Levin, B.R. eds. Sunderland, Massachusetts, Sinauer: 106-125 (1988). 

[7] Barton, N.H. & Charlesworth, B. Why sex and recombination? Science 281: 1986-1990 (1998). 

[8] Otto, S. P., & Barton, N.H. The evolution of recombination: removing the limits to natural selection. 
Genetics 147:879-906 (1997). 


THE EVOLUTIONARY BENEFITS OF SEXUAL REPRODUCTION 


15 


[9] Barton, N.H., & Otto, S.P. Evolution of recombination due to random drift. Genetics 169:2353-2370 
(2005). 

[10] Hill, W.G., & Robertson, A. The effect of linkage on the limits to artihcial selection. Genetical 
Research 8:269-294 (1966). 

[11] Morgan, T. H. Heredity and sex. Columbia University Press, New York (1913). 

[12] Fisher, R. A. The genetical theory of natural selection. Oxford University Press, Oxford (1930). 

[13] Muller, H. J. Some genetic aspects of sex. American Naturalist 66:118-138 (1932). 

[14] Muller, H.J. The relation of recombination to mutational advance. Mutation Research 1:2-9 (1964). 

[15] Salathe, M., Kouyos, R.D. & Bonhoeffer, S. On the causes of selection for recombination underlying 
the Red Queen hypothesis. American Naturalist 174(suppl.):S31-S42 (2009). 

[16] Peters, A. D., & Lively, C.M. The Red Queen and fluctuating epistasis: a population genetic 
analysis of antagonistic coevolution. American Naturalist 154:393-405 (1999). 

[17] Gandon, S., & Otto, S.P. The evolution of sex and recombination in response to abiotic or coevo¬ 
lutionary fluctuations in epistasis. Genetics 175:1835-1853 (2007). 

[18] Pylkov, K. V., Zhivotovsky, L.A. & Feldman, M.W. Migration versus mutation in the evolution 
of recombination under multi-locus selection. Genetical Research 71:247-256 (1998). 

[19] Lenormand, T., & Otto, S.P. The evolution of recombination in a heterogeneous environment. 
Genetics 156:423-438 (2000). 

[20] Otto, S.P. The evolutionary enigma of sex. American Naturalist, 174: S1-S14 (2009). 

[21] Ohta,T., & Kimura,M. A model of mutation appropriate to estimate the number of electrophoret- 
ically detectable alleles in a finite population. Genet. Res. 22, 201-204, (1973). 

[22] Moran, P.A.P. Global stability of genetic systems governed by mutation and selection. Math. Proc. 
Camb. Phil. Soc., 80, 331-336 (1976). 

[23] Moran, P.A.P. Global stability of genetic systems governed by mutation and selection II, Math. 
Proc. Gamb. Phil. Soc., 81, 435-441 (1977). 

[24] Geodakyan, V.A., The role of sexes in the transmission and transformation of genetic information, 
Probl. Peredachi Inf., 1, 105-112, (1965) (In Russian). 

[25] Lloyd, D.L., Benefits and handicaps of sexual reproduction, Evol Biol, 13, 69-111, (1980). 

[26] Livnat, A., Papadimitriou, G., Dushoff, J., & Feldman, M.W., A mixability theory for the role of 
sex in evolution, PNAS, vol. 105, no. 50, 19803-19808, (2008). 

[27] Livnat, A., Papadimitriou, C., Pippenger, N., & Feldman, M.W., Sex, mixability and modularity, 
PNAS, vol. 107, no. 4, 1452-1457, (2009). 

[28] Priigel-Bennett, A., When a genetic algorithm outperforms hill-climbing. Theoretical Gomputer 
Science, 320, (1), 135-153, (2004). 

[29] Maynard-Smith, Evolution in sexual and asexual populations, American Naturalist, 102, 469-678, 
(1968). 

[30] Grow, J.F., & Kimura, M. The theory of genetic loads, Proc XI Int. Gongr. Genetics., vol 2, 
495-505, (1964). 



16 


ANDREW E.M. LEWIS-PYE^ & ANTONIO MONTALBAN^ 


^ Andrew E.M. Lewis-Pye, Department of Mathematics, London School of Economics, 
London UK. Email: andy@aemlewis.co.uk. 

^ Antonio Montalban, Department of Mathematics, University of California, Berkeley, 
USA. Email; antonio@math.berkeley.edu. 



THE EVOLUTIONARY BENEFITS OF SEXUAL REPRODUCTION 


17 


4. Extended Data 


Figures 6 and 7 display the outcome of simulations for the additive hnite populations 
model. Figures 8 and 9 display the outcome of simulations for the additive inhnite pop¬ 
ulations model. Figure 10 displays the outcome of simulations for the hnite populations 
multiplicative model. Where required for our proofs, we have shown that the propor¬ 
tion of a population at the boundaries will generally be small after sufficiently many 
generations have passed. Tables 1, 2 and 3 show the proportion of the population at 
the boundaries for the additive inhnite populations N-model and also for the bounded 


model. All variants of the model are described in 15.5 





Figure 6. Simulations for the hnite additive model. In these simulations 
the ‘standard’ input parameters were: population size 10000; mutation 
rate 0.1; probability mutation is positive 0.1; 10 loci, initial gene htness 
5. In each graph one parameter is varied, while the other parameters take 
the standard values. 100 simulations were run for each parameter set, 
and the mean htnesses as well as the standard deviations for these mean 
htnesses are depicted, after a number of generations which is sufficient 
for the mean htness to stabilise. This number of generations was taken 
to be 4000, except for the case of varying mutation rate where 20000 
generations were run for each simulation. 
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initial gene htness =1 

10 

20 

0.2 

3.5 X 10-^'^ / 1.0 X 10“^^ 

1.1 X 10-^'^ / 3.2 X 10“^^ 

3 X 10-^« / 5.6 X 10“^^ 

0.1 

1.6 X 10-^^^ / 1.0 X 10“^^ 

3.7 X 10-^1 / 1.5 X 10-^‘J 

8.2 X 10“^^ / 6.6 X 10-4^ 

0.05 

2.9 X 10-^^ / 6.6 X 10-4^ 

5.7 X 10-«^ / 3.6 X 10"^^ 

1.1 X 10-«^ / 2.4 X 10-4« 

0.025 

2.7 X lO-'^^ / 1.2 X 10-4’'' 

4.6 X 10-y^ / 1.7 X 10-^y 

1.0 X 10-^^ / 9.4 X 10-^^ 

0.0125 

9.8 X lO-^'J^ / 1.5 X 10-4^ 

1.5 X 10-^^^ / 4.2 X 10“^^ 

4.1 X 10“^^^ / 8.0 X 10-^*^ 


Table 1. The table concerns the inhnite additive N-model, and shows 
the proportion of a 2-locus population which has fitness 1 at either locus 
for sex/asex, after 1000 generations, for varying initial gene htnesses, 
and for varying mutation rates. In all cases the probability that a given 
mutation is benehcial is 10“^. 



N = 200 

300 

400 

0.2 

3.4 X 10-^'^ / 3.8 X lO-^y 

1.5 X 10-«^ / 2.1 X 10-'^^ 

6.6 X 10“^^^ / 1.2 X 10“^'^ 

0.1 

3.5 X lO-i'J'^ / 5.0 X 10“^^ 

3.8 X lO-i^*^ / 2.1 X 10-^4^ 

3.9 X 10-^*^“ / 8.9 X lO-^'^'J 

0.05 

1.4 X 10-^^^ / 1.1 X 10-^4'^ 

7.5 X 10“^^^ / 1.8 X 10-^^^ 

3.9 X 10-^*^^ / 2.8 X lO-^*^^ 


Table 2. The table concerns the inhnite additive bounded model, and 
shows the proportion of a 2-locus population which has htness 1 at either 
locus for sex/asex, after 25000 generations, for varying N (maximum allele 
htness), and for varying mutation rates. In all cases the probability that 
a given mutation is benehcial is 10“^ and all alleles initially have htness 
50. 



N = 200 

300 

400 

0.2 

6.5 X lO-^*^ / 1.2 X lO-^'J 

3.4 X 10“^^ / 9.3 X 10-4^ 

2.6 X 10“^^ / 1.1 X 10-^« 

0.1 

9.2 X 10“^*^ / 2.5 X 10“^^^ 

7.1 X 10“^^ / 1.1 X 10“^^ 

7.5 X 10“^^ / 6.3 X 10“^^ 

0.05 

1.5 X 10“^ / 1.0 X 10“^ 

2.3 X 10"^^ / 1.3 X 10"^^ 

4.5 X 10“^^ / 2.2 X 10-^*^ 


Table 3. The table concerns the inhnite additive bounded model, and 
shows the proportion of a 2-locus population which has htness N (max¬ 
imum allele htness) at either locus for sex/asex, after 25000 generations, 
for varying N and for varying mutation rates. In all cases the probability 
that a given mutation is benehcial is 10“^ and all alleles initially have 
htness 50. 
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Population Size /1000 Mutation rate 






Probability mutation is beneficial 



Mutation rate 



Probability mutation is beneficial 


Figure 7. These graphs display variance and LD^ for the same simula¬ 
tions which have their mean htnesses displayed in Figure 6. 
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Figure 8. Simulations for the infinite additive model appear to show 
V/M reaching a limit value over time. The figure shows approximate val¬ 
ues for these limits, for sex (red) and asex (blue). In all these simulations 
the probability that a given mutation is beneficial was fixed at 0.1, and 
gene fitnesses were initially 5 (although the latter parameter has no effect 
on the limit values found). 














sex — asex 
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g is the probability a given mutation is beneficial. 
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— sex 
asex 


40 



10 IS 20 25 

Number of loci 


Figure 10. Simulations for the finite multiplicative model. In these 
simulations the ‘standard’ input parameters were: population size 10000; 
mutation rate 0.1; probability mutation is positive 0.1; 10 loci, initial 
gene htness 1. In each graph one parameter is varied, while the other 
parameters take the standard values. 100 simulations were run for each 
parameter set, and the logarithms (base 10) of the mean htnesses are 
depicted after 500 generations (without any suggestion that the mean 
htness has stabilised by this point). 
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5. Supplementary Information 


In this section we provide a much deeper analysis of all the claims made in the 


main article. We start, in Subsection 5.1, by proving Theorem 1 and showing how the 
various operations affect different features of a population, such as mean htness and 

is dedicated to the proof of Theorem Although Theorem 


variance. Subsection 5.3 


[T] and Theorem are both concerned with establishing negative values for LD 2 , their 
proofs are completely different and give us alternative ways of understanding the model. 
Theorem is proved in Subsection 5^ The proof of this theorem is much longer than 
those of the previous theorems, and, again, it is very different in style. Once we have 
proved our main theorems, we move on to discuss variants of our model in Subsection 
m The variants we consider are the hnite version and the multiplicative version. 
We do not have a full mathematical analysis for those models, but present the results 
of simulations. The outcomes of simulations are presented in the previous section. 
Extended Data lEl 


5.1. The evolution of the key values. In this subsection we review some well known 
facts and describe in more detail how mutation, selection and recombination affect mean 
htness, variance, LD 2 and hat variance, proving the claims made in Section The 
objective is to establish all of the results in the table below: 


Ehect of: 

selection 

mutation 

recombination 

AM 

VfM 


0 

AV 

A 3 /M - (E/M)" 

t Var(p) 

-LD 2 

ALD 2 

- E«, V.Vi/AV w 

0 

-LD 2 

Ak3 

{V/M){{kJV) - 3 (n 3 /M) + 2{y/MY) 

i KfilJ,) 

-IDs 


Table 4. By AM is meant the change in M produced by the relevant 
operation. All values (M, V, etc) inside the table are with respect to 
the population before the relevant operation is applied: the box stating 
that AM for selection is V/M should be read M(Sel(0)) — M{<p) = 
V{(p)/M{(p). (*) The stated ehect of selection on LD 2 is only valid in the 
case that selection is acting on a population at linkage equilibrium. 


5.1.1. The evolution of mean fitness and variance. The impact of mutation on the mean, 
variance and all cumulants is simply described (recall that mean htness and variance 
are the hrst and second cumulants of F{X)). If Y and C are independent random 
variables and is the nth cumulant, then k„(T + C) = Kn(Y) + K,n{C). Thus the ehect 
of mutation on the mean htness is to increase it by (which will be negative given 

our assumptions on p). Similarly, the ehect on variance is to increase it by f'Var(/i). 
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The effect of selection is given by the following lemma (while these claims are either 
well known or easily established, we include a proof for the sake of completeness). Here 
M = M{<p), V = V{<p), As = k, 3 {<P), A 4 = k, 4 {<P) and M*, V*, Ag are the corresponding 
values for (p* = Sel(0). 

Lemma 4. The effect of selection on mean fitness, variance and A 3 is given by: 

M* -M = V/M, 

V*-V = K3fM-(VfMf, 

k;-K3 = (H/M)((a4/H)-3(a3/M)+ 2(H/M)2). 

Proof. We prove the first two identities. The third then follows with a little more 
algebraic manipulation, by almost identical methods. In order to see the hrst identity, 
note that: 

M* = F{x) Sel{(P){x) = F{xf(p{x). 

X X 

Now, using that the second moment about the origin, F{xY<p{x), is equal to V +M^ 
we get: 

X 

This gives the well known identity V/M = M* — M, as required. In order to derive the 
second identity, we recall the formula for the third central moment: 


Then: 


1^3 = '^{F{x) - Mf(f){x) 

X 


= Y^{F{xf - 3 F{xYM + 3F{x)M^ - M^) (f){x) 

X 



= ^ F{xfcj){x)^ - 3MV - M^. 

X 


V* 


(^F(xf c/)*(x)^ - {M*f = - {M*f. 


Substituting V/M + M for M*, we get: 

y* - y = + V^/M^) - V = nf’/M - V^/M"^, 

X 


as required. 


□ 
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Let us now consider recombination. Recall that Xj is a random variable taking values 
according to the distribution (specifying the distribution at the ith locus). At any 
point, the mean is given by M{(f)) = Since recombination has no effect on 

each (pi, it also has no impact on M{cp). The change of variance due to recombination 
is —LD 2 , by the dehnition of LD 2 . 

5.1.2. Recombination, LD 2 and flat variance. It is not the case, however, that the sum 
of variances is the variance of the sum, unless the random variables in question are 
independent. Since Rec{(p) is at linkage equilibrium, and recombination leaves each (pi 
unchanged, we conclude that R(Rec(0)) = JfiVi, where Vi = 'Va,T{(pi). We therefore 
have: 

£ 

ld2 = v-J2^^- 

i=l 

There is a second way to calculate LD 2 that will be useful later. The variance can 
be expressed: 

V(<t>) = EpY^xV) - 

i=l i=l 

= E (^( V") - EiX,)p + ^ [e(X,X,) - EiX>,EiX ,)). 

i=l 

£ 

= ( E L) + E {e(X,X,) - E(X,)E(Xi )). 

i=l i^j 

This gives a description of LD 2 as a sum of covariance terms: 

LD^ = E {e{X,X,) - E(X,)E(X,)). 

Vj 

Letting M = E{X 2 ),E{Xi)) G recall that the global variance GV = 

GV{<p) was dehned as E{\\X — and the flat variance EV = EV{<p) was dehned as 

E{\\ttp{X — M)\\'^)i/{i — 1). Using that 7rp(X) + 7rrf(X) =X and that, by Pythagoras, 
||7rp(X - M)||2 + IMX - M)f = ||X - M\\f we get: 

£ 

GV = J2y^ = E{\\X-Mf) = 

i=l 

B(|kp(A' - M)f) + E(\MX - M)f) = 'X^FV + jV. 

Thus, since ^ unaffected by recombination, so is — 1)EV + V)/ We can 
also deduce that if (p* is at linkage equilibrium and V* = then EV* = V*. 
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It follows that the effect of recombination on V and FV is to make them equal while 
leaving ((£—l)FV+V)/£ unchanged, thus making them both equal to ((£—l)FV+V)/£. 
We then have: 

V(Rec(^))-V(^) = ^^(FV-V) and FV(Rec(^)) - FV(^) = j(V - FV), 
and 

LB2 = ((£-1)/£)(V-FV). 

5.1.3. The evolution of LD 2 - The most direct way in which recombination affects mean 
htness is by changing the variance, which then affects the growth in mean htness via 
selection. The change in variance due to recombination is given by —LD 2 . Thus, to 
show that recombination has a positive effect on variance, one must show that LD 2 
is negative. In this subsection we analyse the effect on LD 2 given by the different 
operations. As part of our analysis we get a proof of Theorem 

Since LD 2 = 0 when at linkage equilibrium, we have LD 2 {Rec{<p)) = 0. 

Mutation has no effect at all on LD 2 as shown by the following lemma. 

Lemma 5. For any population <f), LD 2 {Vi'u.t{(f))) = LD2{4>). 

Proof. Recall the dehnition of mutation in terms of the random variables Cj. 
LZl2(Mut(0)) = (e((W + Ci){Xj + Cj)) - E{Xi + Ci)E{X^ + C',)) 

= {e{X,Xj) + E{XiC,) + E{aXj) + E{QC,) 

-E{Xi)E{X,) - E{Xi)E{Cj) - E{Ci)E{Xj) - E(C',)E(C',)). 

Since Ci and Cj are independent, and are independent of X* and Xj, most of these 
terms cancel, leaving E{XiXj) — E{Xi)E{Xj) as required. □ 

Let us take this opportunity to note that mutation has no effect at all on linkage 
equilibrium: This is because if the variables Xi are independent, so are the variables 
Xi + Ci- Also, since FV = V — {£/{£— 1))LD2, we conclude that the effect of mutation 
on flat variance is the same as that on variance: FV{Mut{(f>)) — FV{4>) = £Var(/i). 

The effect of selection on LD 2 is more complex and is given by Theorem (restated 
below) in the case that the operation is applied to a population at linkage equilibrium. 
The rest of the subsection is dedicated to proving it. We define LD^ to be the decrease 
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in the third cumulant of F{X) produced by recombination. Thus, 

e 

LD3(0) = «3(i"(X))-5^A3(X,). 

i=l 

As with the other values, we use % to denote and to denote ^(Xj). 

Theorem 1. If <p* = Sel(0) was obtained by an application of selection to a population 
4> at linkage equilibrium, then the effect of recombination on variance is given by: 

l/(Rec(<A*)) - V{4>-) = 

where Vi = Var((/)j) and M = M{<p). 

Theorem asserts, in other words, that LD 2 {Sel{(f>)) = —ifffi^jViVfj/M'^ if 0 is 
at linkage equilibrium. The key to the proof is to study the effect of selection on 
each locus separately, as given by the following lemma. Let us describe our notation. 
Let (f>* = Sel(0). Recall that we use a boldface greek character, 0, to denote the 
distribution of a population in and the lightface version of that character, cfi to 
denote the distribution at the ith. locus. We denote the mean fitness at locus i by 
Wi = E{(j)i). By the linearity of expectation we have M = to 

denote the mean fitness of the loci other than i, i.e., Wi = M — Wi. Use use U to denote 
the variance in fitness at the ith locus: U = Var(0j). The notation is analogous for (f)*\ 
w; = F(f*), u; = Var(0:), etc. 

Lemma 6. If selection acts on a population at linkage equilibrium, the effect on the ith 
locus is given by: 

4>*i{x) = ^ (x + W,^ (j)i{x). 

Proof. First, let us observe that E{F{X) \ Xi = x) = x + Wp 

i 

E{F{X) \Xi = x) = '^E{XfXi = x) = x + ^E{Xf = x + 

i=i jA* jA* 

For X G Z and y G let x '"iy be the vector of length I with x as the i-coordinate 
and with all other coordinates given by y in corresponding order. Second, we calculate 
0:(x): 

= ( 1 /^) F{x^iy)ct){x^iy) 

yeZ<’-l 

= (1/M) (j)i{x) E{F{X) I W = x). 


Putting these equations together, we get the result of the lemma. 


□ 
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The next lemma shows the effect of selection on the htness and variance at a single 
locus. 


Lemma 7. If selection acts on a population at linkage equilibrium, the effect on fitness 
and variance at locus i is given by: 


Wf-W, 
V* - V, 


Yl 

M’ 

M \m) 


Proof. For the first equation: 

W* = ^x4>*{x) 

X 

= (1/M) Y,x{xPWi)UY 

X 

= (hm) (j)i{x) + Wi x(f)i 


X) 


= (1/M) (^(V, + W^) + W,W, 

= (l/M) {V, + MWfi. 

This establishes the first equation of the lemma. 

For the second equation, let Vi be the second moment about the origin of fi, that is, 
^x'^fifix), and analogously for fi*. Let be the third moment about the origin 
of fii, that is, ks^i = Then 

x'^dfix) 


V = ... 

X 

= (1/M) y^^x^ (x + Wi) fiijx) 

X 

= o/M) (+ 

\ X X 


x'^fifix) 


= (1/M) {ks,, + W,V 
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Now, using the developments of the moments about the origin in terms of the central 
moments we get: 

V* = y* = V* 

^3,i = ^3,i + SViWi + Wf , 

+ 


The equation above then becomes 


+ n? + 2^!h + = ^ + 3V,'^ + ^ ^ Yi + W'P, 


M 

which we can re-arrange as 


M 


v;+\2V^-W^-V^ 


M \M \ M M ^ 


-V, and that ( ^ + f - IT,^ ) = 0. 


To finish the proof of the lemma one only has to observe that ( 214 ^ — 314-^ “ ~ 


■Wi 


■Wi 


□ 


We now continue with the proof of Theorem]^ Using that LD 2 = V — get: 


LD^m - LD^i,!,) = (V -V)-{Y,V; -V,) 



LD, , (£.L^)-V 

M M2 


M2 

The last equality follows since LD^ = 0 for a population at linkage equilibrium. 

5.2. The ordering on distribntions. This subsection is dedicated to proving some 
basic combinatorial lemmas which are required for the proof of Theorem Our new 
key notion is the ordering ^ among probability distributions on Z, which will be useful 
throughout the rest of the paper. We made the assumption earlier that all cumulants of 
populations are hnite. It is similarly to be assumed that all cumulants of distributions 
discussed in this section are finite. 

Definition 8. Given two distributions ipi and 'ip 2 '- ^ we define: 


4>2 ^ V’l 


(V61 < 62 e Z) 4 ’i(&i)4’2(&2) < 4’i(&2)4’2(&i)- 
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We let -02 if, in addition, there exist bi < b 2 E Z with '^i(&i)V’ 2 (^ 2 ) < '0i(^2)'02(&i)- 


To give some intuition for the meaning of let us remark that if ifi and '^2 are 
non-zero on an interval [A,B], and zero elsewhere, then: 


7p2 ^ i/ji 


(V6 G Z with A < b < B) 


V’2(& + 1) ^ 'ipi{b+ 1) 
i’2{b) ~ iJi{b) 


If 'ip 2 A this gives a lot of information about the supports of V’l and V ’2 (i-e. those 
X for which 7 ^ 0 or ' 02 (a;) 7 ^ 0). If x is in the support of 0i, then for any y > x in 

the support of 02, U must also be in the support of 0i. Similarly, if x is in the support 
of 02, then for any y < x in the support of 0i, y must also be in the support of 02- We 
can therefore find disjoint (possibly empty) sets 111,112 and Ila such that the support 
of 02 is Ill U 112 , the support of 0i is 112 U IIs, and all the elements of IIi are below all 
the elements of 112 which are all below all the elements of Ila. 

The main three properties of the ordering ^ are that it is preserved by mutation, it 
is preserved by selection, and it preserves the ordering of expected values. The proof of 
Theorem in the next section will use all of these lemmas to show that LD 2 becomes 
and remains negative during an asex process initially at linkage equilibrium. 


Lemma 9. The orderings -< and ^ are preserved by mutation. That is: 

02 ^ 01 ^ Mut(02) 0 Mut(0i). 

The same holds for -<. Here Mut refers to the mutation operation for i = 1. 


Proof. We must show that for any values b 2 > bp. 


(1) E 02(d)/r(62 - c?) ■ E V'i(c)/x(&i - c) < ^ 0i(d)/i(62 - c?) ' E '*^2(c)/i(6i - c). 

d c d c 

The r.h.s. can be re-expressed: 

^ (^0i(c)02(c)/i(62 - c)/i(6i - c)) + ^ ((0i(ci)02(c)/i(62 “ d)/i(&i - c) 

c d>c 

-F 0 i(c) 02 (d)/i (62 - c)y{bi - d))^. 

The l.h.s. is: 

(02(c)0i(c)/i(62 - c)/r(6i - c) + ((02(d)0i(c)/i(62 - d)/i(6i - c) 

c d>c 

-F02(c)0i(d)/i(62 - c)/i(6i - d)) j. 
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For any given pair [d, c) such that d > c dehne: 

ai = '0i(c)V’2(d), 02 = '0i((i)V’2(c), 

/ 3 i = ii{b2 - c)/i(6i - d), [52 = /i(&2 - d)^{bi - c). 

Now for any values d > c we have 02 > ai because "02 ^ t/’i- We claim that we also 
have [32 > [di. this holds because in order to have / 3 i > 0 one requires 62 < c + 1 
and d < 61 + 1, which can only be the case if 61 + 1 = c + 1 = 62 = d. In that case 
/ 3 i = /i(l)/i(—1) and [32 = /i(0)/i(0), and it follows that [32 > [ 3 i from our assumption 
that /r( 0 ) > /i(— 1 ) > Thus: 

0:2/32 + Oi/3i > 01/32 + 02/3 i. 

This establishes the inequality Q. 

Now suppose that 1P2 and let hi < 62 be such that V’i(fei)'?/’2(&2) < V’i(&2)t/^2(&i)- 

Consider again the expansions of the l.h.s. and r.h.s. of Q. Since we have already 
shown that each term on the r.h.s. is greater than or equal to the corresponding term 
on the left, we need only identify one term on the right which is strictly greater than 
the corresponding term on the left. The reasoning above already suffices to give this 
strict inequality for the case c = 61, d = 62, since then 0:2 > Oi and [32 = /u(0)^ > A- ^ 

The next lemma shows that ^ is also preserved by selection. In fact we shall prove a 
stronger result. For £ = 1 and IF G M, we dehne a new form of selection, which, as we 
saw in Lemma allows us to understand the effect of selection on a single locus under 
certain conditions. For 0 a probability distribution on Z and x G Z, we dehne 



where s is the normalising factor required to make Selii/(0) a probability distribution: 
s = + W) 0 (x) = E{([)) + W. We call a probability distribution on Z non-trivial 

if its support consists of more than one point. 

Lemma 10 . If Wi < IF2 and 'ip2 ^ then Selw2{'f’2) ^ Furthermore, 

if Wi < W2, 1/^2 ^ 'f’l and at least one of ipi and 'ip2 is non-trivial, then Selw2{'f’2) -< 

Selvi/i (i/’i). 

Proof. Let = Selvyi('^i), let = Selvi/2('02) and consider hi < A- On the one side 
we have 


A (^1)^2 (A) 


1 


{bi + Wi){b2 + W2)Mbi)Mb2), 


S1S2 
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which we need to show is less than or equal to 

V’i(&2)'02(^l) ~ -(^2 + Wi){hi + W2)'lpl{h2)'lp2{bi), 

SIS2 

where si and S2 are the normalising factors for 'ipi and 'ip2- We know that '0i(^i)'02(&2) < 
'0i(&2)'02(&i), so it is enough to show that (61 + Wi)(62 + < (&2 + Wi)(6i + W2). For 

this, one just needs to observe that: 

(62 + hFi)(6i + W2) - (61 + Wi)(62 + W2) = (62 - h){W2 - Wi) > 0. 

Note that this actually suffices to show (61 + Wi){b2 + hF2) < (&2 + hFi)(6i + hF2) if 
Wi < W2. 

Now suppose that we also have hFi < hF2. The reasoning above actually suffices to 
show for all pairs bi < 62 that V’i(^i)l/^2(^2) < '0i(^2)t/’2(^i)) t/’i(^2)t/’2(&i) > 

0 . If at least one of fji and '^2 is non-trivial then there exists a pair bi < 62 with 
'0i(&2)'02(&i) > 0 , giving Selpv'2('02) -< Selwi(' 0 i) as required. □ 

Lemma 11 . //'02 ^ i/’i? then E{pp2) < Efipi). Furthermore, i/V’2 V’l then E{'ip2) < 


Proof. The proof is divided into various cases depending on the supports of V’l and ^’2- 
Let Hi, 112 and fls be as defined subsequent to Definition]^ 

Case 1 : The support of both fji and -02 is a finite interval [A,B] (so 111 = 113 = 0 
and 112 = [A,B]). This is the simplest case, but gives the principal idea for the entire 
proof. We will define probability density functions (pi for i G [A,B], with ■02 = Fa ^ 
Fa+1 ^ ^ = - 01 , and E{ipi) < E{ipi+i) for all i G [A, B). Each ipi will satisfy: 


(V 6 G [A,i)) 


Fiip + 1 ) 


'ifiib + l) 

fJiip) 


and 


(VfeG [*,S)) 


Fiip + 1 ) 


02(fe + 1) 
02 (&) 


Suppose we have already defined Fi and we want to define Fi+i- We need to change 
the value of from to without changing any of the other fractions. 

For this, we need to find values c, d such that defining ipi^i{b) = c Fi{b) ior b < i and 
ipi+i{b) = d Flip) for 6 > i gives the required probability density function. To find such 
c and d all one needs to do is to solve the following equation: 


c^ + d(l-^) = 1 

d 0l(i)02(* + 1 ) = c 0 i(z + 1)02(0^ 

where S = EhnV’.O)' Since we have c < 1 < d, and if SfUl < 

then c < 1 < d. Since we are increasing the values of Fi{^) b > i and 
decreasing them for 6 < i, it is not hard to see that E{ipi) < E{(pi^i), and that if 
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1 then E{(pi) < E{(pi+i). This hnishes the construction of the cpiS and 
the proof for the case where the support of 'ipi and "02 is [A,B]. 

Case 2: The support of 0 i and 02 is not an interval, but it still hnite and equal for 
both functions. The proof above works almost the same way, except that one has to 
skip the values not in the support. 

Case 3 : The supports of 0 i and 02 are equal, but while 112 is bounded below it is 
not bounded above. One runs the same proof, but now constructs an inhnite sequence 
02 = A ‘^A+i A ■■■ ■ Ultimately 0 i is the limit of this sequence, i.e. for all b, 
01 ( 6 ) = \im.iLpi{h). Since we have assumed that 0 i and 02 have finite means, it follows 
that U(0i) = \iTsii^^E{p)i). 

Case 4 : The supports of 0 i and 02 are equal, but while 112 is bounded above it is 
not bounded below. One runs the same proof, but now constructs an infinite sequence 
02 = 9^5 ^ A ■ ■ ■ 1 such that E{ipi) < E{ipi_i) for all i < B. Again we have 0 i as 
the limit of this sequence and E{'ipi) = limj_^oo E{p>i). 

Case 5 : The supports of 0 i and 02 are equal, and 112 neither bounded above nor 
bounded below. One runs almost the same proof, but now in two stages. Choosing 
A G n2, we first construct an inhnite sequence 02 = A P^A+i A ■ ■ ■ which has the 
intermediate value 03 as limit. One then constructs an inhnite sequence 03 = (p'j^ ^ 
(P'a_i a ■ ■ ■ with 01 as limit. 

Case 6 : At least one of fli or 113 is non-empty. If 112 is empty then it immediately 
follows that U( 02 ) < U( 0 i), so suppose this does not hold. Let 01 be the probability 
density function formed from 0 i by restricting the support to 112 (and normalising as 
appropriate), and form 02 similarly. If fli is non-empty then we have: 

U(02) < U(0*) < U(010 < U(0i). 

If n 3 is non-empty then we have: 

U(02) < U(0;) < U(0t) < U(0i). 

□ 

5 . 3 . The properties (f) and (ft). This section is dedicated to proving Theorem]^ 
which asserts that LD2 stays negative throughout the process, independent of what 
operations are applied and in which order, except when recombination has just been 
applied in which case LD2 = 0 . This is for the Z-model, and assuming truncation is 
applied after each application of mutation and recombination (or at least before any 
application of selection). 
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We previously described a distribution on Z as non-trivial if there exists more than 
more point in the support. We shall refer to a population 4 > (on 1 / for £ > 1 ) as non¬ 
trivial if at least two of the marginal distributions 0* are non-trivial (and providing this 
remains the case when truncation is applied to 0). 

The key idea behind the proof of Theorem is to consider a property, (f), which 
suffices to ensure that LD2 is non-positive. We will also consider a strengthening of 
(t), which we call (ff) and which ensures that LD2 is negative. To prove Theorem]^ 
we then use induction on the generations and show that the property (f) is satisfied at 
each step of the process, and that, in fact, the stronger property (ff) is also satisfied 
from some early stage onwards (the first stage after which Sel is applied to a non-trivial 
population). 

Definition 12 . Given a probability distribution 0 : —)■ ]R-° and a eTj, we shall say 
that 0 “ is defined if fi{a) = 7^ 0 . In this case 0 “ is the distribution given by 

-0“(6) = 0(0, h)/ip{a). 

We say 0 satisfies (f) if for every oi < 02 such that and are defined, 0 “^ ^ 

. We say that satisfies (ff) if, in addition, there exist Oi < 02 such that 0 “! and 
(xj-Q defined and 0“^ -< 0“! as witnessed by a pair hi < 62 with Oi -|- &i > 0. 


To define (f) for a population with £ > 2, we need to consider each locus compared to 

the rest of the loci altogether. For i 7^ j G {1, ■■■,!}, let FfiX) = Xi-\ -|-Wj_i-|-Xj+i-|- 

■ ■ ■ + X(, = F{X) — Xi. Given a population <p, let be the distribution corresponding 
to the random variable {Xi, Fi{X)). Equivalently: 

0i(a,6)= 4){x). 

x&lh 

Xi=a, F{x)=a+b 

Definition 13. A population satisfies (f) iffii does for every i = 1, ...,£. A population 
(f) satisfies (ff) if there exists i such that fit satisfies (ff). 

The next step is to prove that (f) and (ff) are preserved through the operations. 
Recall that we are assuming the process starts at linkage equilibrium. Note that if 4> 
is at linkage equilibrium, then (f) holds - in that case we have equality between the 
left-hand side and the right-hand side in the definition of the ^ relation. 


Lemma 14 . Fori > 2 , if (fi satisfies (f), then Sel( 0 ) satisfies (f), and if 4 > is non¬ 
trivial then Sel( 0 ) satisfies (ff). 


Proof. Let <fi* = Sel{<f>). Fix i G A similar argument to that of Lemma 

shows that fi*{a, h) = ( 1 /M) (a -t- h) fifia, h). Thus, ( 0 *)“ = Sela(( 0 j)“). Suppose ai < 02 
are such that ( 0 *)“^ and ( 0 *)“^ are both defined. It follows from Lemma 10 that, since 
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^ (0i)“S 'WG have (0^)“^ ^ If (p is non-trivial it also follows from Lemma 


as required. □ 

Lemma 15. Both (f) and (ff) are preserved by mutation. 

Proof. First, let us note that mutation can be broken down into a number of consecutive 
steps, by treating one locus at a time. Let Mut^ be the application of mutation on the 
/cth locus, i.e., Mut^(Xi,..., X*,,..., = (Xi,..., X^ + Cfc,..., X^). We will show that 

both (t) and (ff) are preserved by each of the operations Mut^. Let cp* = Mut*^(0). Fix 
i. Assuming (f) or (ff) for pi, we establish that the same condition holds for p*. 

Suppose hrst that k ^ i. Then for a E Z, = Mut((0j)“), because the mutation 

happens in one of the loci included in the second coordinate, and has the same effect on 
Fj(X). Consider oi < 02 such that and ( 0 *)“^ are both dehned. It then follows 

from Lemma that since (0*)“^ ^ {PiT^ we have (0*)“^ ^ and hence that p* 

satisfies (f). We get (ff) similarly. 

If fc = i, then the proof is the same. One just needs to observe that -0: —)■ ]R-° 

satisfies (f) (or (ff)) if and only if p'{a,b) = p{h,a) does. □ 

So far both lemmas hold for any of the models. The following lemma only holds for 
the Z-model. 


10 


that (0*)“^ -< (0*)“L Thus p* satisfies (f) and also satisfies (ff) if p is non-trivial. 


Lemma 16. Both (f) and (ff) are preserved by truncation for the Z-model. 


Proof. Let p* be the population which results from an application of truncation to p. 
Let s = where D is as in the Z-model. Note that 0*(a, 6) = 0 if a -|- 6 < 0 

and Pi (a, b) = ppa, b)/ s if a b > 0. 

Fix i, ai < 02 and bi < 62- Then if 0 *(ai, 61) 7^ 0 we have ai-\-bi > 0 , and hence both 
oi -I- 62 and 02 + are positive. Therefore, if (0j)“0&2)(0i)“^(&i) > (0i)“^(&2)(0i)“0&i) 
then (0*)“0&2)(0i)“^(^i) > (0i)“^(^2)(0i)“^(&i)- In the same way if (ff) holds because 
(0i)“i(62)(0i)“^(&i) > (0i)“^(&2)(0i)“^(&i) holds and Oi -|-61 > 0 (the latter condition be¬ 


ing required by Definition 12), then this implies (0*)“n^2)(0i)“^(&i) > (0*)“^(^2)(0*)“H&u 
Thus both (t) and (ff) are preserved, as required. □ 

The second part of the proof of Theorem is to show the connection between the 
properties (f), (ff) and LD 2 . Recall that Co(X, X) is the covariance of the random 
variables X and X, i.e., Co(X, X) = E{XY) — E{X)E{Y). Using our calculations from 


.1.2 we obtain that: 


i=l 
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Lemma 17. If ip-.l? ^ M-° satisfies (f), then Co{(p) < 0. Furthermore, if (p satisfies 
(ft) then Co(ip) < 0. 

Proof. Let X,Y be random variables such that {X,Y) has probability distribution (p. 
The key idea is to use that (f) implies that E{Y\X = a) is decreasing in a, which follows 
from Lemma [H 

Set V = E{X). Now we would like to put u = E{Y\X = v), but since we may have 
V ^ Tj this presents a slight difficulty. If u ^ Z, we let u be a number in between 
E{Y\X = [nj) and E{Y\X = [n]). Now let IT = X - n and Z = F - u. Since v and u 
are constants we have: 


Co(X, Y) = Co(IT, Z) = EiWZ) - E{W)E{Z). 

Let '0(IT, Z) specify the distribution on the pair (IT, Z). Notice that since ip satishes 
(t), so does ip. Let 'pw and 'pz specify the corresponding marginal distributions, and let 
‘ipz{Z\W) specify the conditional distribution. We claim that E{WZ) is non-positive. 
This is because: 


EE 'ijj{a,b)-ab = ^ ^ = y^afjw{ 0 ')E{Z\W = a). 


Now satisfaction of (f) and Lemma 11 imply that when a is positive E{Z\W = a) is 
non-positive, and when a is negative E{Z\W = a) is non-negative. Also E(W)E{Z) = 0 
because E{W) = 0. 

If (p satisfies (ff) then E{Z\W = a) is non-zero somewhere, and hence E(WZ) < 0, 
as needed to get Co(X, T) < 0. □ 


Corollary 18. If 4> satisfies (f), then LD 2 {<p) < 0. Furthermore, if 0 satisfies (ff) 
then LD 2 { 4 >) < 0. 


To finish the proof of Theorem all we need to observe is that since the population 


starts at linkage equilibrium, it initially satisfies (f). By Lemmas 14, 15 and the 
condition (f) is then satisfied throughout the process. Furthermore, the condition (ff) is 
satisfied after an application of Sel to any non-trivial population (mutation will quickly 
produce non-trivial populations), and then remains satished until any such point as Rec 
is applied. Since satisfaction of (ff) ensures negative LD 2 , any non-trivial application 
of Rec therefore increases variance. 


5.4. The bounded model. This section provides a deep analysis of the asymptotic 
behavior of the bounded model and is dedicated to giving the full proof of Theorem 
1^ Recall that in the bounded models the fitness values of the genes are restricted to 
{!,..., N}, which we denote [1, N], and the domain of the process is D = [1, NY, which 
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we sometimes denote N^. Also recall our assumption that our mutation distributions 
only take non-zero values on —1, 0 and 1; we let a = /i(—1), b = ju(0) and c = ju(l). 
Even though that assumption seems to be inessential in simulations, it is important 
for the type of formal analysis we do in this section. Recall also that we assume that 
c < a < b. 

Let us restate TheoremIn the statement of the theorem, we consider the case where 
both types of individuals, sexual and asexual, live together. Mutation and selection act 
the same way on both, but recombination acts only among the sexual individuals. 
Imagine, for instance, that at a certain time there exists a purely asexual population, 
and that by random chance a few of the individuals in the population then become 
sexual and start reproducing among themselves. The theorem states that over time the 
proportion of the population which is sexual will then tend to 1. 

Theorem 3. For every a, b, c with c < a, for every £ > 1, and for all sufficiently large 
N (i.e. there exists Nq such that WN > Nq), whatever the initial population is, so long 
as the proportion of sexual individuals is non-zero, we have that the proportion of sexual 
individuals converges to 1 and the proportion of asexual ones converges to 0. 


Let us fix the values of a, b, c and i throughout the rest of Section 5.4 


The proof of Theorem proceeds by showing that the mean htness of the sexual 
population is eventually higher than that of the asexual population, over a time average. 
We will be able to show that the asexual population in isolation converges to a limit 
distribution, and we will provide an upper bound for the mean fitness of that limit. 
While we strongly suspect that the sexual population in isolation also converges to a 
limit (as evidenced by simulations), we have not been able to prove it. Nevertheless, 
we can still provide a lower bound for the geometric mean of the mean fitness over 
generations, which is larger than the upper bound for asex. We will show this is enough 
to establish that sex outperforms asex. To obtain these upper and lower bounds, the 
key technique is to study the case when positive and negative mutations have the same 
probabilities. This case is much easier to analyse, and we then find a way of translating 
those results to the case we are interested in, where downward mutations are more 
likely. An issue that we have to be constantly aware of is how truncation affects the 
populations. 

Let us start by showing that we can analyse the sexual and asexual populations sep¬ 
arately, as we were doing earlier in the paper. Let cff G be the probability 

distribution for the entire population at stage t (no confusion should result from any 
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conflict in notation with that given in Dehnition[^- the latter notation may be consid¬ 
ered only to apply to the proof of Theorem . We can split 0* into two fnnctions 
and 0asex) each with domain D. Let and p>isex be the proportions of individnals 
which are sexnal and asexnal respectively, i.e., = \4>sex\ v^asex = I^LexI) where 

01 is the taxicab norm-. For 0 G 


iV’i = \'^{.x)\. 

x^D 


So far, we have always assnmed that populations, 0, are probability distributions, and 
hence that |0| = 1. This is not the case with and 0asex! we will not be making 
that assumption anymore. If a probability distribution is really required, all we have to 
do is consider normalisation: 


Proj(0) 


iv^r 


From now on when we refer to the operations of truncation and selection, we will omit 
the normalisation: Thus, when we apply truncation, giving Trunc(0), we just erase the 
individuals outside the domain, and applying selection, giving Sel(0), simply involves 
multiplying 0(a;) by F{x). We do not alter the dehnition of Mut as this operation 
already preserves the norm. The dehnition of Rec is now altered in the obvious manner, 
so that it remains norm preserving (requiring division by |0|^“^). Since the normal¬ 
ization operation, Proj(0), commutes with all the other operations, it does not really 
matter when we apply it. The advantage of describing the same process in this new 
fashion, is that now mutation, truncation and selection act independently for the sexual 
and asexual populations, and it is only normalisation that involves interaction between 
them. Of the operations selection, recombination, mutation and truncation, only trun¬ 
cation and selection affect the values and 9?asex- Since recombination and mutation 
commute, we can and will assume in what follows that the operations cycle through in 
that order. In fact, we will assume that is obtained from <p^ by applying: mutation, 
truncation, selection and recombination in that order (recombination, of course, being 
applied for sex only). For each t, let: 


ALx = 


|Rec(Sel(Trunc(Mut(0*^J)))| 


10; 


t I 
sex I 


For the quotients, we have: 

y^asex 


n-1 \t-l 
rsex '^sex 

>nt-l \t-l 
r asex '^asex 


^ |Sel(Trunc(Mut(0‘33j))| 


\t _ 
'^asex 


10: 


t I 
asex I 


y^Lx 


n 


i=0 ^sex 


T asex 


n 


i=0 ^asex 


and 
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To establish Theorem it will be enough to show that for ah sufficiently large N: 


lim 

t 


n 


i=0 ^sex 


n 


* 1 yi 
i=0 asex 


+ 00 . 


We may therefore consider the populations separately, because all we need be concerned 
with are the values n!: ii^iex and nt; ALex, which evolve independently. We will 
establish the limit above, by showing that the geometric mean of is eventually 
always greater than that of A^^^^ by at least a hxed margin. 

Let us hx the notation for dealing with geometric means. Given a hnite sequence 
ai,...,an of numbers, we let GMt<„({at}) = Oj. Given an inhnite sequence 

we let 


GM(at) = lim sup 


\ n®*’ 

\ *=i 


GM fa^) = lim inf 


\ n®*’ 

\ *=i 


and if both limits are the same we call this common value GMi(at). 

The rest of this subsection is dedicated to proving the following theorems: 


Theorem 19. For every N, the limit of exists and, for r = b + 2y/ac: 


lim Ai,„„ < Nir^. 


t^OO 


Using facts observed from simulations, we are conhdent in claiming that in actual fact 
hniAr_).oo linp^oo K^sex/^ — AVe will not need this extra fact, however, and the result 
of the theorem will be enough for our purposes. 


Theorem 20. Let t = b + 2^/ac as above. For all sufficiently large N: 

gm(alj/ > Ntr . 

t^OO 

Using facts observed from simulations, we are conhdent in claiming that in actual fact 
hniAr_s.oo linp^oo Kex/^ = — 1)), which is greater than ir^ for £ > 1 and 

r < 1. Once again, we will not need this extra fact, however, and the result of the 
theorem will be enough for our purposes. 

It then follows from the theorems above that GMf^oo(ALx/ALex) > 1 for any large 
enough N, and hence that hmt(]^*~Q A 3 g,j.)/(n*~Q ^asex) = +oo required. 

5.4.1. Understanding A^g^, and X\sex- general, given a population t/? G we dehne 

^ |(Sel(Trunc(Mut(V;))))| 


Then A^^,^ = X{<Psex)y similarly for asex. 
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we use M{xl)) to denote its 


Given a population i/? G , let be the proportion of individuals surviving 

mutation followed by truncation, i.e.: 

^ |Trunc(Mnt(V>))| 

IV’I 

Let us remark that < 1. Given a population G we use to denote its 

mean htness, even in the case that i/? is not normalised: 

/ N cMe F(x)tb(x) 

Mi'll)) = \ 

m 

The increase in norm caused by an application of selection (ignoring normalization) is 
given by the mean htness: 

Let us remark that M{il)) < Ni because F{x) < Ni for every x G D. 

Since mutation and recombination do not affect the norms, we have: 


M(V>) = 


= Mi'll)). 


X{'il)) = pi'll)) Mi'll)'), 


where ip' = Trunc(Mut('i/>)). 


5.4.2. Cha'ngmg the parameters. A key idea here is to use the case when positive and 
negative mutations are equiprobable to get information about the case we are interested 
in, where c < a. In this subsection we show how we can change the mutation parameters 
from a, b, c to a', b', d satisfying a' = c', in a manner which allows us to translate from 
one process to the other in a controlled way. 

We dehne a', b', d so that they satisfy the following equations: 

a' + &' + h = 1, = , and d = d. 

vac Va'c' 


The reason we require b/^/ac = b'jy/d'd will become clear later. These equations are 
enough to determine the values of a', b' and d as follows. Since a' = c' = (1 — b')/2, for 


T = b + 2y/ac we get: 

b 

y 

(2) 

y/ac 

\iy-y) 

( 3 ) 

6(1 - b') 

= 2y y/ac 

( 4 ) 

b 

\/ac b 

= b'ib + 2\/~(xd) 

( 5 ) 

^fdd ~ y 

= r. 
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So b' = b/r and a' = c' = ^/^/T. 

To better visualize the translation from the a6c-process to the a'b'c'-process, let us 
start by considering the case I = \ first. Consider the diagonal square matrix C of size 
N X N given by: 

C{x, x) = . 

Our goal now is to show that applying the afec-process to a population (p is equivalent 
to applying the a'fe'c'-process to C ■ 0 up to a factor of r. In other words, we will show 
that T ■ Sel(Muta'a'(^ ' 0)) = ^ ' Sel(Mutac(0)). 

Let Mutac be the matrix corresponding to an application of mutation with probabilities 
/i(—1) = a, /i(0) = b and /i(l) = c, followed by truncation (but without normalisation). 
That is: 


Mutac = 


1 ^ 

a 

0 

0 


0 

0 \ 

c 

b 

a 

0 


0 

0 

0 

c 

b 

a 


0 

0 

0 

0 

c 

b 



0 

0 

0 

0 



b 

a 

0 

0 

0 

0 


c 

b / 


From now on, we will assume truncation is part of mutation, and mutation refers to 
multiplication by Mutac. 

Lemma 21. In the case I, = 1: 

T ■ Muta'a' = C ■ Mutac ' C~^ 

Proof. We carry out the matrix multiplications: 
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V : ; ; ; y 

The last equality uses that ^a'd = a! = c' and that = p- D 

Notice that, in the 1-locus case, selection without normalization is given by a diagonal 
matrix Sel where Sel(i,i) = i. Since diagonal matrices commute, we have: 

r • Sel ■ Muta'a' ■ C = C ■ Sel ■ Mutac- 

The case when £ > 1 is not overly different, but the notation is now a little more 
cumbersome. Consider the diagonal square matrix C of size x given by 

where a_i, oq, ai are a, b, c respectively. 

Let us use Mutac to denote the matrix corresponding to a6c-mutation with i genes. We 
should actually denote this matrix by Mutac,£, 7 V 5 but since there is no risk of confusion 
we prefer to simplify the notation. 


Lemma 22. For i > 1: 

■ Muta'a' = C ■ Mutac ' C~^ 

Proof. Consider xjj G . Then, for x = (xi,..., a;^), 

and 

t't( 

2i=—li2=—1 i£=—l \ 

In the equation above, assume that if (xi — ii,X 2 — * 2 , — ig) ^ D, then '^(ti — 

ii,X 2 — ^ 2 , — if) = 0. Replacing each oq by OqT, each a_i by aflT^/aZfJa[ and 

each ai by a'lT^/af^JaZ[ we get 

il=—1 *2=—1 ii=—l \i=l 

= rVa-i/ai 

il=-l*2=—1 *^=—1 \i=l 


^ 'if>{xi - ii,X2 - i2, - it) 


'if>{xi-ii,X2-i2, ...,Xi-ii)\/ 


T,j ^j+J2i * 


j o 


Mutac-C b^(a;) = ^ 


Y\ai. I ij{xi-ii,X2-i2, ...,xe-ie)^/a-i/ai 


' E ^ j '^j 


i=i 
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□ 


/ 

b 

a 

0 

0 

.. \ 


2c 

2b 

2a 

0 



0 

3c 

3b 

3a 



0 

0 
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5.4.3. The fixed point for Asex. In this subsection we prove Theorem 19 which states 
that the limit of less than or equal to Nir^. The proof has two steps. First we 

show that if the asex process reaches a fixed point t/?, then A('i/?) < Nir^. Second, we 
show that, independent of the starting point, the asex population always converges to 
a hxed point. 

As we mentioned before, mutation (which we now consider to incorporate truncation) 
acts on a population (considered as a vector) by multiplying this vector by the matrix 
We use SelMut„f. to denote the matrix Sel • In the case £ = 1 we have 


SelMutn^ = 


The following lemma shows how useful is the translation developed in the previous 
subsection. 

Lemma 23. Suppose that i/’ac £ a a fixed point for the asex process. Then X{xl)< 
Nirfi 

Proof. Let i/? = fi^aciN- That t/? is a fixed point for the asex process means i/? = 
Proj (SelMutac-lA); or equivalently that ^j^ is an eigenvector for SelMutac with eigenvalue 
X{'ip), i.e., SelMutac ■ = X{'ijj)'ip. From Lemma 22 we have that ■ SelMuta/Q/ ■ C = 

C ■ SelMutac- It follows that ■j? = C ■ i/’ is an eigenvector of SelMuta/Q/ with eigenvalue 
r“^A(i/’)- Thus 

/A(i9) = A(V>), 

where A('i9) is calculated using the mutation = a', /^'(O) = b', /i'(l) = ah (We 

should use the notation Xa'afi'^) and Xafifi’) to specify the mutation used, but it will be 
clear from context which dehnition we are using.) Since A('i9) = p('j9)M(Muta/a/-'j9) < N^, 
we have A('i/j) < as required. □ 


Theorem 19 now follows from the following lemma. 


Lemma 24. For every a, b, c, i, N, there is a unique ^ 

non-negative, non-zero (j) G .■ 




such that for any 


/imt^oo(ProjSelMut„c)* ■ 0 = 
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Proof. We apply the Perron-Frobenius theorem, which states that a non-negative, ir¬ 
reducible and primitive matrix has a positive (real) eigenvalue A whose absolute value 
is larger that that of any other eigenvalue, and that A has a unique (up to scaling) 
associated eigenvector all whose coordinates are positive. The matrix SelMutac is non¬ 
negative, in the sense that all its entries are non-negative. It is also irreducible and 
primitive because all the entries of (SelMutac)^ are positive. So we can apply the 
Perron-Frobenius theorem and get a positive eigenvector i/? = 1 /?^^in ^ which is a 
probability distribution with a positive eigenvalue A that is the largest in absolute value. 
As a corollary of the Perron-Frobenius theorem we also get that lim^^oo (SelMutac)V-^* 
is the projection to the eigenspace given by ip, and that this projection is non-zero 
for any non-zero non-negative initial population. This implies that t/? is a universal 
attractor of the system defined by iterating SelMutac and normalisation.(For a similar 
application of the Perron-Frobenius Theorem in the previous literature, but which does 
not make use of the techniques established here to provide estimates for the mean of 
the resulting fixed point, seel^l^ □ 


Before we move on to consider the asymptotic behaviour for the sex process, we 
need to form a stronger version of Theorem 19 for the 1-locus case (where the sex 
and asex processes are identical). While we shall not establish for general i that 
liniAr_s.oo linit^oo K.sex/^ = shall now do so for the case i = 1 (since we shall 

later be able to apply this result in analysing the sex process). 


Lemma 25. Let 'ipac^N be the probability distribution which is the fixed point of the 
1-locus process, and let ’da’a\N = PEoj(C' • 'ipac,N)- Then: 

(1) lim^a'a', n)/N = 1. 

(2) limN^ocP{^a'a',N) = 1 - 

(3) limN^ooM{nVLta'a' ' ^a'a',N)/N = 1. 


Proof. Since we consider a, b and c to be fixed, let V’n = 'f’ac,N and Pn = 'da'a'.v- We shall 
establish (1) and (2), and then (3) follows immediately from the definition of A('dAr). 
The key to understanding 'dTv is to calculate the following quotients. For k < N, define: 


r]N{k) 


'^Nik -\- 1 ) 

'dnik) 


Let Aat = A('dAr). Since Pn is a fixed point we have that 'dAf(l) = {b'pN{l)-\-a''dN{2))/Xj^ 
and 'dAr(iV) = (c''dAr(iV — 1) + b''dpf{N))N/X]s[. It follows that: 


^n(I) = ^ and r]NiN - 1) = —- 

a' Xn/N — a 
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For X ^ we have = [d'd^q^x — 1) + + a'd^^x + 1 ))x/Xn- Using 

that disfix + 1) = riN{x)'dN{x), we get: 

c'r]N{x — 1)“^ + dr]N{x) = \n/x — h'. 

Now suppose that (1) does not hold. In this case there exists an inhnite set 11 C M, 
such that liniArgn Xn/N = k < 1 (note that Xn < N). For each a; G hJ, dehne: 

R(x) = lini riNiXl — x)~^. 

Wen 

From the formulas for rj{N — x) above (and using that a' = c'), we deduce that R 
satishes the following inductive dehnition: 

Ril) = and Rik + 1) = - Rik)-^. 

a' a' 

All values of R are non-negative, because so are the corresponding values of r]]^{k). 
Notice that R{2) < i?(l), and that R{k) < R{k — 1) implies R{k -|- 1) < R{k), from 
which we may conclude that R is decreasing. R must then have a limit, a say. This 
limit must satisfy a + a~^ = (a — b')/a'. Since for every a G a -f a~^ > 2, 
2 < (k — h')/a'. From the fact that h' = 1 — 2a', it follows that k > 1, which gives the 
required contradiction. 

In order to establish (2), we show hrst of all that hmAr^oo'dA(l) = 0. This now follows 
easily, however, from the fact that lirnTY^-oo-^Ar = oo and = {X^ — b')/a'. 

The hnal step is to show that = 0. Once again, consider the sequence 

R{x) as dehned above. We have: 

1-6' 

/?(1) = —;— = 2 and R{x -|- 1) = 2 — R{x)~^. 
a' 

We conclude that R{x) > 1 for all x. From this it follows that for each x and all 
sufficiently large N, — x) < 1. This suffices to ensure that lim 7 v^oo'dv(-N) = 0, as 

required. □ 


Lemma 26. Let 'ipac,N be the probability distribution which is the fixed point of the 
1-locus process. Then: 

(1) limN^ocX{fiac,N)/N = T. 

(2) limN^ocp{f’ac,N) = 1 - 

(3) /mAr^ooiU(Mutac • fjac,N)/N = T. 


Proof. Again, let V’a = fi’ac^N and let = ’da'a',N be as dehned in the statement of 
Lemma 25 Given Lemma 25, and the fact that rA('d 7 v) = -^(t/’v) (as established in the 
proof of Lemma 23), it suffices to establish (2). That limAr^oot/’iv(-^) = 0, follows from 
the corresponding fact for 'djv, however, since V’at -< '^at. It remains then, to show that 
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liniAr^oo'0N(l) 
k < N, define: 


0. We use a similar method to the proof of Lemma 25 


r]N{k) 


'il^Njk + 1 ) 
i>N{k) 


Now let Aat = A('07v)- We have that: 


This time for 


,,(i) = LlT:, 

a 

The result then follows from the fact that limAr_>.ooAAr = oo. 


□ 


5.4.4. The asymptotic behavior for sex. The rest of Section |5.4| is dedicated 
ing Theorem 20, which gives a lower bound for the geometric mean of 
sufficiently large N: 


to prov- 
For all 


^(ALJ > Nir^. 

t^OO 


The proof requires a sequence of lemmas, some of which we will state now and prove in 
later subsections. Before describing the general architecture of the argument, let us con¬ 
sider how to analyse the £-locus sex process by looking at the different loci individually. 
The reason we can do this is that, since the sex population stays at linkage equilib¬ 
rium, its probability distribution is determined by the product of the distributions of 
the individual loci. 


Let t/j G be a population where all the loci are independent, as for instance after 
an application of recombination. Assume ^J^ has been normalised. Let xjji G be the 
probability distribution for the ith locus. We would like to analyse the a6c-sex-process 
on i/? by analysing its process on -0*. It is not hard to see that if a population is at 
linkage equilibrium then the effect of mutation and truncation on the whole population 
is equivalent to considering the effect of mutation on each single locus independently 
as we did in the proof of Lemma 15 Let = Mutac,i • t/’i, where Mutac,i is the 1- 
locus mutation, and pi = pippi) = |V’i|/|'0i|- ■ 'ip, then we have 

that tp'{x) = Y[l=i'P'i^i) p{jp) = Y[l=iPi'4’i)- Finally, we let W = Mipp'f) and 
Wi = recall that = ^i + Wi. From Lemma we 

have that the effect of selection on a single locus is given by ip* = Sel^p, ■ tp[, where 
Selx is the diagonal matrix with Selx(j, j) = (j + K)/M. Since this matrix actually 
depends also on M, from now on, to simplify the notation we let 


Seli^(j,j) =j + K 
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and leave the normalisation for later if necessary. The increase in norm prodnced by 
Selx applied to V’ ^ is then + K. Hence for = Mutac,i • V’i we have: 




l^il 




The next step is to describe how we are going to nse the translation to the a' = d 
case for sex popnlations. Consider the following setting. Let be a seqnence of 

real nnmbers in [0, (£ — l)iV] (which will later represent the seqnence LL* for some fixed 
i). Let G (later this will represent the initial sexnal popnlation at some locns), 
and let = C ■ Assnme that is not the zero vector. For every t eN, define: 


-0*+^ = Seli^t ■ Mutac,i ■ and = Select ■ Muta/a/^i ■ 

From Lemma 21 we have = C ■ p)^ for every t. Define: 


\t — 
\ — 


\p) 


4+11 


I# 


= p{P;^)-{M{{P;^y) + K 


and = 


|#+^| 


= p(i90-(M((i97) + iF'), 


where (pp^)' = Mutac,i ■ p^^ and (#)' = Muta/a/^i ■ #. 

If -0* was a fixed point, then nsing that r* ■ 'd* = C ■ "0* we conld conclnde that is 
also a fixed point (all given the appropriate normalisations), and that = rA^ as in 
the proof of Lemma 23 Even withont assnming that the process converges to a limit 
distribntion, we still get that these valnes have the same geometric means: 


Lemma 27. GMt^oo(AyA(^) = r. 


Proof. For every k, we have that: 

+^119^1 \C ■ p)^\ 


\p)'^\ |C'-Proj(V>^)| 


I'd'^l \C ■ p)^\ |V’°| IC ■ Proj('0°)| 

The set {0 G (M-^)^ : \p)\ = 1} is compact and hence the image of the continnons map 
pj ^ \C ■ Proj('^)| is a closed interval of the form [a, a(5] for 0 < a and 1 < /d (we get 
that a > 0 becanse \C ■ Proj('0))| is always positive). We then have: 


GIVKAP = 



< GM(Ay 


I# 


t<k 


Symmetrically GMt<fc(A^) > rf/ GMt<A;(A(^). The lemma then follows from the fact 
that both p /and pfd converge to 1 as fc —)■ cxd. □ 


The next step is to give an approximate calcnlation for A^, which holds irrespective of 
the choice for p)^. The next lemma shows that A(^ is eventnally always close io N + K^. 
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The reason this holds is that p{'d^) is eventually always close to 1, and M{{'d^y) is 
eventually always close to N because positive mutations are as likely as negative ones. 


Lemma 28. For any choice of non-negative -0° G and let be 

defined as above. There is a sequence {eArjiveN converging to 0 such that, for every N, 
every sequence and every non-negative 0°, the following holds for all sufficiently 

large t: 


1 — Cn < 



N + K* 


< 1 . 


The proof of this lemma is a little technical, so we delay it until Subsection 5.4.5 The 
following is a small lemma concerning geometric means, which will allow us to compare 
the geometric means of and K^. 


Lemma 29. Let {a''}t<k be a sequence of positive real numbers and let b be a positive 
number. Then b + GMt^k(a^) < + a*). 


Proof. This is a corollary of Jensen’s inequality which states that ip{k ^ 7i) < 

k~^ Y^i=i Pili) ioY (p: M —)■ M convex. One has to apply it to the values = log(a*) and 
the function ^p{x) = log(6 + e^) which is convex because = be^/{b + >0. □ 


For a given choice of 0°, we can use what we have so far to get a lower bound for 
GM(p(0*)). For all sufficiently large k\ 


( 6 ) 

( 7 ) 

( 8 ) 


GMp(0') = GM 

t<k 


\t 


t<k 

> (^iwr__ 

- M((00') + K* 

N + K^ 


> r GM 


\t<k M((00') + 


(1 — Cn) 


The value inside the large parentheses is greater than 1 for large N and sufficiently 
large t because there exists a sequence {eAr} 7 veN with limit 0 such that, for large t, 
M{{y)^y) < tN + Cat < iV, as proved in the next lemma. The lemma also shows that, 
for large N and for sufficiently large t, the value of 0* at the upper boundary N is very 
close to 0. 


Lemma 30. For any choice of G — {0} and let {0*}teN be defined as 

above, and let (0*)' = Mut^ci • 0*- There exists a sequence {eArjA^eN converging to 0 
such that, for every N, every sequence every 0°, the following holds for 

all sufficiently large t: 

M((0*)') < riV + Cat and 0*(A^)/|0*| < cat. 
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Proof. Let 0° G be the probability distribution with (jr{N) = 1. For each t dehne 
= SeloMutac,! • 0^ By induction on t and using Lemmas and 10 we conclude 
that ^ (jf and ^ (f/)' for all t. Using Lemma 11 we then get that < 

Applying Lemma 26, we conclude that there exists a sequence {eArjArgpj 
with limit 0 such that M((0*)') (and so also M{{ip^y)) remains below tN + for all 
sufficiently large t. The second claim of the lemma also follows from Lemma and the 
fact that V’* A 0*- n 


For the last stretch of the proof we need to be more concrete about the sexual popu¬ 
lation we are analysing. Let G be the initial sexual population, and be 

the sequence obtained by iterating ac-mutation, selection and recombination. For each 
locus i and generation t, let ipl be the distribution at locus i at stage t, but ignoring 
normalisation. We use the same notation we have been using so far: 

• (V’*)' = Mut„c,i • f’h 

• {'ipy = Mut^c ■ 

. w/ = Mmyy 
. M^ = M{{^py) = zLwt, 

.wi = Mi- wi- 

• 'ipl'^^ = Sel^t ■ Mutaci • 'ipj] 

• Pi^p^) = \i^py\m = uU pm- 

The objective now is to show that the geometric mean of A('i/?*) = p{ip^)M^ = 
(11^=1 is above Nir^. We will apply the results we have obtained thus far 

for = Wy In order to be able to do this, however, we need to be able to compare 
IF/ and ML If all loci were identical, we would have IF/ = M*(f' — l)/i. When the loci 
are not identical, the following lemma gives us an approximation to W + IF/, and tells 
us that it is close - at least in geometric mean - to W + M*(L — l)/i, just as it would 
be if the loci were identical. 


Lemma 31. There is a sequence {cat : N G N} converging to 0 such that for every N 
and every initial population 'ip^ G as above, the following holds for all sufficiently 
large k: 


1 - ejv < GM 

t<k 


N + Wf 


N + My{i-i)/i) 


< 1 + Cn- 


We will prove this lemma in Subsection 5.4.6 Lemmas ^ and all assert the 
existence of certain sequences {eArjArgN with limit 0. We now let {eArjArgpj be a sequence 
with limit 0, which majorises each of the sequences provided by these lemmas. 









50 


ANDREW E.M. LEWIS-PYE^ & ANTONIO MONTALBAN^ 


We are now ready to finish the proof of Theorem Let k be large. We start by 
cleaning up equation 


using what we now know from Lemmas 29 and|31| Fix i < i. 
N + Wf 


GMp > r 

t<k * “ 


> r 


= T 


> T 


GM 

t<k wi + wj 

GM ^ 

t<k 


GM 

t<k 


“r+ 




= P (1 + 6/^) (1 - eiv)^ 

where = Gy\.t<k{(-N/M^) — 1. Notice that Gy\.t<k{(-N/M^) > 1. Furthermore, by 


Lemma 30, < £(riV + e^) for sufficiently large t. Adjusting the sequence cn as 

necessary (but maintaining the fact that it has limit 0), we then have that for sufficiently 
large k, ^k > ((1 - r)/T) - e^. 


Finally, 


( 9 ) 

( 10 ) 
( 11 ) 


GM(A 

t<k 


* ) 
sexy 


> 


GM(AC)n9M(l-‘) 

Ni 


i=l 


1 + 'Cfe 


(r (1 + ^k/^) (1 - (^n^Y 


3£ 


1 + 

for all large enough k. The last observation to make is that there exists e > 0, indepen¬ 
dent of N and fc, such that the factor 

is greater than 1 -|- e for large N and sufficiently large k. To see this, note that the 
function x h->• (1 -1- x/iY/{l + x) is always greater than 1 for x > 0 and tends to -|-cx) 
as X —>■ -|-cx). (It is actually increasing for x > 0.) Let N* be large enough that 
Cat < (1 — r)/r for all N > N*. Among all the x’s with x > ((1 — r)/r) — cat*, there is 
a minimum possible value for (1 -f x/f')^/(l -f- x), call it (, which is greater than 1. Let 
e be such that C, = 1 + 2e. Then for N > N* for which cat is sufficiently small, we have 
j (1 ~ > 1 -|- e for all large enough k. 


It remains to prove Lemmas 28 and 31 
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5.4.5. The proof of Lemma 28. Roughly speaking, we need to show that gets close 
to N + as t becomes large. Recall that + K^), and that ^(19*) = 

1 — a'd*(l)/|'d*| — c#(A^)/|#|. The proof will have three parts which are: showing that 
M(('d*)') gets close to N, showing that 'd*(l)/|'d*| gets close to 0, and showing that 
'd*(A^)/|'d*| gets close to 0. We remark that it is not surprising that M('d*) gets close to 
N: since positive and negative mutations are equiprobable, mutation without truncation 
does not bring the mean htness down, while selection only ever increases mean htness. 
Therefore the mean htness can be expected to rise, this rise being halted only by effect 
of truncation at the upper boundary. 

The hrst idea for the proof is to consider an alternative population which evolves 
according to a different sequence one that is constant, and that is either always 

larger or else always smaller than the original one. The fact that the sequence is constant 
allows us to apply the Perron-Frobenius theorem and establish a limit population, which 
we can later analyse. Choosing a sequence iC* with larger (smaller) values will guarantee 
that the new sequence is -<-below (-above) the original. This allows us to compare the 
mean htnesses of the two populations, as well as their values at the boundaries 1 and 
N. 

Let us begin with the analysis of the limit populations. Fix a value of K, for which 
we will later substitute either 0 or (£ — l)iV. Dehne 


= m and 


Lt+l 


= Sel;^Mut„v,i ■ 0 . 

Since (jf is dehned by iterating a linear system which is non-negative, primitive and 
irreducible, we can apply the Perron-Frobenius theorem, exactly as we did in Lemma 
to deduce that the populations cjf must converge to a limit population which is 
independent of the starting population (and depends only on N, K and a'). In order to 
analyse we proceed much as in the proof of Lemma 2L. Once again, the key idea is 
to consider quotients between consecutive values in the distribution. For k < N, dehne: 

0Ar(fc -f 1) 


T]N{k) = 


4>N{k) 


Let Xn = A(0Ar) = p(0Ar)(M((0Ar)')-FiF). Since 0Ar(l) = (6'0Y(l) + a'0A(2))(l-hiP)/AAr 
and 0Ar(iV) = (c'0Ar(iV — 1) + b'(j)]\[{N)){N + K)/Xn we have: 

Aa,/(1 + K)-b' 


Vn{1) = 


and riN{N — 1) = 


XN/{N + K)-y' 

For X 0 {1, we have 4>n{x) = (c'07v(a; — 1) + b'(j)N{x) + + 1))(t + K)/Xisf. 

Since (fwix -|- 1) = ?7Ar(a;)0Ar(a;) this gives: 

c'r]N{x - 1)“^ -F ar]N{x) = Xn/{x + K) - b'. 
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Let US now move into the proof that the mean htness grows close to N. Consider 
K = (£ — l)iV, and define (j)^ as above for that K. Since (where K* is the 

sequence given in the statement of the lemma), it follows by induction using Lemmas 
1^ and 10 that for every f, 0* -< This means that 0*(1)/|0*| > 'd*(l)/|'d*|, and (by 
Lemma 11) that M(0*) < M('d*). In order to establish that limjv Xn/{^N) = 1, suppose 
otherwise. Then there must exist an infinite set If, such that lim 7 ven-^A/(^-^) = k < 1 
(note that Aat < £N). For each x G N, define: 


R(x) = lim riNiN — x) 

Wen 

From the formulas for ri{N — x) above (and using that a' = c'), it follows that each R{x) 
is defined and satisfies the following inductive definition: 

i?(l) =- and R{k + 1) =- R(k)~^. 

a! a! 

All the values of R are non-negative, because so are the corresponding values of r]j^{k). 
Note that Ri2) < i?(l), and that R{k) < R{k — 1) implies R{k -|- 1) < R{k), from 
which we conclude that R is decreasing. R must then have a limit, a say. This limit 
must satisfy a -f a~^ = (a — b')/a'. Since for every a G M"*", a + a~^ > 2, we have 
that 2 < (k — h')/a'. Since h' = 1 — 2a', it follows that k > 1, which gives the required 
contradiction. 

So far we have concluded that /N = 1. Since Aat = + 

{£ — l)iV) and p{(f>N) < 1, it follows that limAr_).ooAf(0'Ar)/iV = 1. For now, let = 
1 — M{(p'j^)/N. Since (0')' ^ (i?*)', we also know that liminf* M(('d')')/At > 1 — eN- 
The second step is to show that 'd*(l)/|'d*| is small for large t. Since 0' ^ i?', we know 
that 0'(1)/|0*| > #(1)/1#I, so it is enough to show that once normalised 0v(l) is small 
for large N. This time we define: 


R{k) = lim 

N^OO 


We have that: 


^ y— b_ ^ 

a' 


i/{i -1) - y 

a' 


R{k)-^. 


Since {£/{£ — 1) — b')/a' > {1 — b')/a' = 2 it follows inductively that R{k) > 1 for all k. 
This means that for every k, there exists N large enough such that Pn'{x) > 1 for all 
N' > N and all x < k. Redefine cat to be the maximum between the value ejsf specified 
in the above and 1/k for the largest k such that Pn{x) > 1 for all x < k. It follows that 
for that for all N, 0Ar(l) < and that the sequence en converges to 0. 

The third step is to consider d'(iV). This time we set K = 0 and consider the new 
corresponding sequence 0', with the new limit 0Ar. We now have that i?* ^ 0*, and 
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hence that '(9*(iV)/|#| < 0*(iV)/|0*|. This time, for each x we dehne: 


Rix) = lim r]]\f(N — x) 
Aen 


-1 


By the same argument as above we get that limjv^ooAA/ {N + K) = 1, and in this case 
this means that limAr_).ooAAr/iV = 1. R now satishes: 




b' 


= 2 and R{x + 1) = 2 — R{x 


1-1 


Again we have that R{k) > 1 for all k, which means that for sufficiently large N, 
— x) < 1 for all x < k. We can therefore redehne so that this sequence still 
converges to 0 and: 

p(#)(M((#)') + 


lim inf 

t 


N + K^ 


> 1 — 


as 


needed for Lemma |28l 


5.4.6. The proof of Lemma 31 In this section we prove the last lemma required to 


complete the proof of Theorem 20 Roughly speaking, Lemma 31 asserts that the 


various Wfs (for varying i) eventually stay relatively close to each other, even if they 
are initially quite different. In simulations we have observed that in fact all of the V/fs 
converge to the same value M/i (see for instance Figure 5), but this seems to be hard 
to prove. Instead, we prove that N + LF* becomes close to + M^{i ~ 1)/^ in geometric 
mean, which is enough for our purposes. 

Let us begin by looking at a 2-locus ac-sex process where selection acts with an 
additive value at stage t. More formally, let {K^ : t G a;} be a sequence of numbers 


in [0, {i — 2)N], let Uo,u° 
those two loci, and dehne: 


t,N 


be the initial distributions corresponding to each of 


= SelM((„‘)')+A‘llu'tac,i • Uq and = SelA^((„t)/)+xtMutac,i • u^. 

Notice how the the M-value used in dehning selection at a given locus is the mean 
corresponding to the other locus (as it should be for the 2-locus sex process). 


Lemma 32. There is a sequence {5* : t E u} such that each 6^ > 1, with GMt^oo < 
1 -f Cat and such that for all t: 

y ^ N + M[{v\)’) + K‘ ^ 

6‘ - N + M{(v‘„)') + K‘ - ' 

Proof. Let 05 G be new initial populations, such that 0o is the probability 
distribution with 0o(l) = 1 and 05 is the probability distribution with 05(A^) = 1- Let 
the 0Q and 05 processes evolve as follows: 

00 +^ = SelA^((^t)/)+x‘Mut„c,i ■ 00 and 05+^ = SelM((,,i‘)')+A*Mutac,i ■ 05- 
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Consider the translations of 0o and 01 to the a'a'-process: i.e., let 'dl. = t ■ 0q and 


■di = r ■ (f)\. From Lemmas 27 and 28 we get that: 


and 


(-N < 


1 — eN < 


GM(p(0*)(M((0*y) + M((0*y) 




rGM(Ar + M((0‘)') + 770 
GM(p(0*)(M((00') + M((0*)') + 77* 


< 1 + ejv, 


< 1 + eAT. 


rGM(iV + M((0*)') + 77*) 

Taking the qnotient of these eqnations we conclnde that we can redehne the seqnence 
Cat so that it still converges to 0 , and so that: 

' p{4>\)\ N + M{{ct>\y) + K^' 


(1 - +v) < GM 




GM 


AT + M((0*)') + 77* 


< (1 + CAr). 


From Lemmas and 10, it follows indnctively that 0o — *^0 — *^1 0o — 

every t. We will now use the fact that 0 q ^ 0i to establish that the numerators above are 

essentially greater than the denominators. We know from Lemma 11 that (0o)^ — (*^ 1 )^ 


implies il7((00') > M((00'). Also, 0o ^ implies that 0i(l)/|0i| < 0o(l)/l</’ol- 


know that (j)\{N)/\(j)\\ < from Lemma 30 (with cat as specihed there). We therefore 
have that: 

Pi.4>i) = l-a0i(l)/l0i|-c0U^)/l07vl > l-a0o(l)/|0o|-c0o(^)/l‘/’7v|-ceA = p(0o)-ce7v. 

Since p(0o) >1 — a — c = 6 >c, it follows that p(0o) — ccat > p(0o)(l — eAr). We can 
therefore redehne the cat so that the sequence still converges to 0 and: 

W + M((0*)') + 77*' 


1 < GM 


A^ + M((0*)') + 77* 


< (1 + Cat). 


Let (5* be the term inside the large parentheses, i.e., d* = > 1 and 

GM((50 < 1 + Cat. Since M((0 q)') < M((r;0') < M((0*0') and M((0 q)') < M{(v\)') < 
M(( 0 *i)'), we have: 

1 aA+MIA;*!'! +/7* . ^ 


^ ^ iV + M((t;*)*) + 77* ^ 
d* - Ar + M((r;0')+ 77* - ' 


Now let us return to the proof of Lemma 
existence is ensured by Lemma 

d* = n 


have that G 


32 


31 


For each i < j, let 6j j be the d* whose 
for the case 77* = l^L = - Wj. Let 

M((5*) < (1 + and that for every t and i 7 ^ j: 

N + m 


N + Wj 


< A. 


Since + = iN — it follows that for some j, N + Wj > N + M^{i — 

l)/7. Therefore, for every i, N + 111* > (Al + M*(7 — l)/7)/d*. A similar argument shows 
that N + IF/ < (77 + M^{i — l)/7)d*, which completes the proof of Lemma 


31 
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5.5. Variants of the model. In this subsection we briefly consider variants of the 
model for which populations may be flnite or inflnite and fitnesses may be additive or 
multiplicative. For the most part, our analysis here will rely on the results of simu¬ 
lations, although we shall also be able to draw some concrete conclusions concerning 
fundamental similarities and differences between the models. First, let us describe these 
variants. 

5.5.1. The finite model. For the flnite model we consider an extra parameter P G N, 
which determines the size of the population. This size is then fixed through the gen¬ 
erations, so that a population always consists of P vectors, Xi, ....,Xp, in Z^. Let us 
consider the sex process first. In order to apply selection one chooses 2P individuals, 
by sampling independently from the population with replacement: if M is the mean 
fitness of the population and F{x) is the fitness of individual x, then the probability 
that individual x is chosen for the nth sample (1 < n < 2P) is F{x)/M. One may 
consider the parent generation as forming a pool of gametes. The probability that a 
gamete chosen uniformly at random from this pool comes from a given individual x, is 
proportional to the fitness of x. During the selection phase we are choosing P many 
pairs of individuals from which gametes are taken (recombination later being applied to 
each of these pairs). To apply the mutation operation, we take in turn each individual 
from the P-many pairs chosen during selection, and for each locus we change its fitness 
value by —1, 0 or 1 with probabilities /i(—1), /i(0) and /i(l) respectively. To apply 
recombination, we take each of the P pairs resulting from mutation in turn. Suppose 
that the nth pair is x\ = and x"^ = x"^ Then we form a;* 

which is the nth member of the next generation by taking each locus i in turn and 
defining either x* ^ ^ or x* ^ = xF^ each with equal probability. The assumption 

of maximum recombination rates might be justified by considering that one is choos¬ 
ing a representative gene from each chromosome, meaning that the monitored genes 
lie on distinct chromosomes. For the asex process, one proceeds similarly, except that 
P many individuals rather than pairs are chosen during the selection phase, and the 
recombination phase is omitted. 

The flnite model is clearly the most important to understand, and the analysis we 
have provided for the inflnite model provides a good approximation for large populations 
and over a number of generations which is not too large. As mentioned previously, 
the equations governing the change in mean fitness and variance due to selection and 
mutation for the inflnite population model would now perfectly describe the expected 
effect of mutation and selection for flnite populations, and the flnite populations model 
could be seen simply as a stochastic approximation to the inflnite case, were it not for 
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the loss in variance and higher cumulants due to sampling. While the effect of sampling 
may not be too significant for large populations on a stage by stage basis, long term it 
will have the effect that the mean fitness no longer tends to infinity over stages. Without 
providing a rigorous proof, one may reason that this is perhaps unsurprising as follows. 
Mutation will still have a fixed expected effect on the mean and variance at each stage. 
For a population cp with M = M{(p), V = V{(p) and = K3{4>), however, while the 
expected effect of selection on the mean is just as for the infinite population model, the 
expected effect of selection on variance is now: 



Now the ratios between cumulants will not tend to increase without limit (in the infinite 
populations model simulations show these ratios converging to fixed values over time, 
and such behaviour is also approximated for the finite model). Thus, if variance was to 
increase without limit, selection would soon produce decreases in variance outweighing 
any increases given by mutation. A similar analysis can be made including the effect of 
recombination, establishing that for sufficiently large variances, the effect of sampling 
will outweigh any other increases in variance. 

5.5.2. The multiplicative model. This model is defined exactly like the additive one 
with the sole difference that the fitness of an individual is calculated multiplicatively, 
i-e., F{X) = nt .]^Xi. For the infinite case, the sex and asex processes now behave 
identically, given populations initially at linkage equilibrium. This was initially observed 
by Maynard-Smith.^ 

Lemma 33. Multiplicative selection preserves linkage equilibrium. 

Proof. Suppose that 0 is a population at linkage equilibrium. Let Xi,..., be random 
variables distributed according to 0, and let Xj',...,X^ be distributed according to 
<p* = Sel(0). We show that selection maintains independence between the hrst two 
loci, as the general result is very similar. We must show that whenever P(Xj‘ = mi) ^ 0 


and P(Xj' = m 2 ) 7 ^ 0 : 


P(X 


2 


n\ X* = mi) = P(X; 


2 


n\ XI = m 2 ). 


This is equivalent to: 


P(X* = mi A X* = n) _ P(X* 


= m 2 A X 2 = n) 


P(X* =mi) 


P(X* = m 2 ) 
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Now, since selection acts according to multiplicative fitnesses: 

P(Xj'=mi) mi P(Xi = mi) 

P(Xj"=m2) m2P(Xi = m2)’ 

Also: 

'P{Xl=mi A X 2 = n) nmiP(Xi = mi A X 2 = n) 

P(Xi=m 2 A X 2 = n) nm 2 P(Xi = m 2 A X 2 = n) ’ 

so the result follows from linkage equilibrium for 4>. □ 

Thus, if a population begins at linkage equilibrium, this linkage equilibrium will be 
preserved throughout all the stages (for the N- and bounded-models). Each application 
of recombination now has no effect on the population. 

For the finite multiplicative model, however, sampling will produce linkage disequi¬ 
librium and sex now robustly outperforms asex (as seen in the outcomes of simulations 
presented in Q. For an insightful analysis of mechanisms which may allow negative 
LD 2 to build up in this context see.^^ 

5.6. The simulations. For a small number of loci i one can implement the algorithms 
described directly. If one wishes to deal with a larger number of loci for the infinite 
population sex process then one can achieve more efficient simulations (for the N and 
bounded models, and which will give only tiny margins of error due to truncation issues 
for the Z model), by making use of Lemmawhich allows one to track the entire pop¬ 
ulation by monitoring each locus separately. Similarly, one can achieve more efficient 
simulations for the asex infinite population process (for the Z-model, and with only tiny 
margins of error for the other domains) by monitoring only the distribution on the total 
htness of individuals. For finite populations, such mechanisms for improving efficiency 
are not generally necessary (or indeed possible). In considering the unbounded infinite 
populations models, of course one can only deal with a bounded domain in practice. 
One is therefore limited in the number of generations which can be simulated. To make 
the computations more precise for the infinite bounded model, we represented real num¬ 
bers by their logarithms, as the values of the probability distribution at the upper and 
lower bounds are extremely small. 
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